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Abstract 

The Heat theorem reveals the second law of equilibrium Thermody- 
namics (i.e. existence of Entropy) as a manifestation of a general prop- 
erty of Hamiltonian Mechanics and of the Ergodic Hypothesis, valid 
for 1 as well as 10 23 degrees of freedom systems, i.e. for simple as 
well as very complex systems, and reflecting the Hamiltonian nature of 
the microscopic motion. In Nonequilibrium Thermodynamics theorems 
of comparable generality do not seem to be available. Yet it is possi- 
ble to find general, model independent, properties valid even for simple 
chaotic systems (i.e. the hyperbolic ones), which acquire special inter- 
est for large systems: the Chaotic Hypothesis leads to the Fluctuation 
Theorem which provides general properties of certain very large fluctu- 
ations and reflects the time-reversal symmetry. Implications on Fluids 
and Quantum systems are briefly hinted. The physical meaning of the 
Chaotic Hypothesis, of SRB distributions and of the Fluctuation The- 
orem is discussed in the context of their interpretation and relevance 
in terms of Coarse Grained Partitions of phase space. This review is 
written taking some care that each section and appendix is readable 
either independently of the rest or with only few cross references. 
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1 The Heat Theorem 



An important contribution of Boltzmann to Physics as well as to research 
methods in Physics has been the Heat Theorem. 

Summarizing here an intellectual development, spanning about twenty 
years of work, the Heat Theorem for systems of particles of positions q 
and momenta p, whose dynamics is modeled by a Hamiltonian of the form 
H = K(p) + W(q), K = p 2 , can be formulated as follows 

Heat Theorem: In a isolated mechanical system, time averages (F) of 
the observables, i.e. of functions F on phase space, are computable as their 
integrals with respect to probability distributions fi a which depend on the 
control parameters a determining the states. It is possible to find four ob- 
servables, whose averages can be called U,V,T,p, depending on a, so that 
an infinitesimal change da implies variations dU, dV of U, V so related that 



where p = (— dyW) and V is a(ny) parameter on which W depends, and 
U, T are the average total energy and the average total kinetic energy. 
When the system is large and V is the volume available to the particles the 
quantity p can be shown to have the interpretation of physical "pressure " on 
the walls of the available volume. 

Remarks: (a) Identification of T with the average kinetic energy had been 
for Boltzmann a starting point, assumed a priori, from the works of Kronig 
and Clausius of a few years earlier (all apparently unaware, as everybody 
else, of the works of Bernoulli, Herapath, Waterstone, [1]). 

(b) Connection with observations is made by identifying curves in parameter 
space, t — > a(t), with reversible processes. And in an infinitesimal process, 
defined by a line element da, the quantity pdV is identified with the work 
the system performs, dU with the energy variation and dQ = dU + pdV 
as the heat absorbed. Then relation Eq.(l.l) implies that Carnot machines 
have the highest efficiency. The latter is one of the forms of the second law, 
which leads to the existence of entropy as a function of state in macroscopic 
Thermodynamics, [2]. 

(c) Eq.(l.l), combined with the (independent) assumption that heat ex- 
tracted at a fixed temperature cannot be fully transformed into work, im- 
plies that in any process ^ < dS. Hence in isolated systems changing 
equilibrium state cannot make entropy decrease, or in colorful language the 
entropy of the Universe cannot decrease, [3, p. 1-44-12]. Actually by suit- 



dU + pdV 
f 



= "exact" = J dS 



(1.1) 
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ably defining what is meant by irreversible process it is possible to reach the 
conclusion that, unless the change of equilibrium state is achieved via a re- 
versible process, the entropy of an isolated system does increase strictly, [2]. 
Conceptually, however, this is an addition to the second law, [3, p. 1-44-13]. 

Examples of control parameters are simply U,V, or T,V, or p, V. The 
theorem holds under some hypotheses which evolved from 

(a) all motions are periodic (1866) 

(b) aperiodic motions can be considered periodic with infinite period (!), [4]. 

(c) motion visits all phase space of given total energy: in modern terminology 
this is the ergodic hypothesis (1868-1884), [5]. 

The guiding idea was that Eq.(l.l) would be true for all systems de- 
scribed by a Hamiltonian H = K + W: no matter whether having few or 
many degrees of freedom, as long as the ergodic hypothesis could be supposed 
true. 

In other words Eq.(l.l) should be considered as a consequence of the 
Hamiltonian nature of motions: it is true for all systems whether with one 
degree of freedom (as in the 1866 paper by Boltzmann) or with 10 19 degrees 
of freedom (as in the 1884 paper by Boltzmann). 

It is, in a sense, a property of the particular Hamiltonian structure of 
Newton's equations (Hamiltonian given as sum of kinetc plus potential en- 
ergy with kinetic energy equal to J2i ^pf an d potential energy purely po- 
sitional). True for all (ergodic) systems: trivial for 1 degree of freedom, a 
surprising curiosity for few degrees and an important law of Nature for 10 19 
degrees of freedom (as in 1 cm 3 of H2). 

The aspect of Boltzmann's approach that will be retained here is that 
some universal laws merely reflect basic properties of the equations of motion 
which may have deep consequences in large systems: the roots of the second 
Law can be found, [4], in the simple properties of the pendulum motion. 

Realizing the mechanical meaning of the second law induced the birth 
of the theory of ensembles, developed by Boltzmann between 1871 (as rec- 
ognized by Gibbs in the introduction to his treatise) and 1884, hence of 
Statistical Mechanics. 

Another example of the kind are the reciprocal relations of Onsager, 
which reflect time reversal symmetry of the Hamiltonian systems consid- 
ered above. Reciprocity relations are a first step towards understanding 
non equilibrium properties. They impose strong constraints on transport 
coefficients, i.e. on the E-derivatives of various average currents induced 
by external forces of intensities E = (£1, . . . ,E n ), which disturb the system 
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from an equilibrium state into a new stationary state. The derivation leads to 
the quantitative form of reciprocity which is expressed by the "Fluctuation- 
Dissipation Theorems", i.e. by the Green-Kubo formulae, expressing the 
transport coefficient of a current in terms of the mean square fluctuations 
of its long time averages. 

In the above Boltzmann's papers (as well as in several other of his works) 
Thermodynamics is derived on the assumption that motions are periodic, 
hence very regular: see the above mentioned ergodic hypothesis. Neverthe- 
less heat is commonly regarded as associated with the chaotic motions of 
molecules and thermal phenomena are associated with fluctuations due to 
chaotic motions at molecular level. A theme that is pursued in this paper it 
to investigate how to reconcile opposites like order and chaos within a uni- 
fied approach so general to cover not only equilibrium Statistical Mechanics, 
but many aspects of nonequilibrium stationary states. An overview is in the 
first thiteen sections, while the appendices enter into technical details, still 
keeping at a heuristic level in discusing a matter that is often given little 
conideration by Physicists because of its widespread reputation of being just 
abstract Mathematics: hopefully this will help to divulge a theory which is 
not only simple conceptually nut it seems promising of further developments. 

The above comment is meant also to explain the meaning of the title of 
this paper. 

2 Time Reversal Symmetry 

In a way transport coefficients are still equilibrium properties and nothing 
is implied by reciprocity when E is strictly ^ 0. 

It is certainly interesting to investigate whether time reversal has impor- 
tant implications in systems which are really out of equilibrium, i.e. subject 
to non conservative forces which generate currents (transporting mass, or 
charge, or heat or several of such quantities). 

There have been many attempts in this direction: it is important to quote 
the reference [6] which summarizes a series of works by a Russian school and 
completes them. In this paper an extension of the Fluctuation-Dissipation 
theorem, as a reflection of time reversal, is presented, deriving relations 
which, after having been further developed, have become known as "work 
theorems" and/or "transient fluctuation theorems" for transformations of 
systems out of equilibrium, [7, 8, 9, 10, 11, 12]. 

For definiteness it is worth recalling that a dynamical system with equa- 
tions x = f(x) in phase space, whose motions will be given by maps t — ► SfX, 
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is called "reversible" if there is a smooth (i.e. continuously differentiable) 
isometry / of phase space, anticommuting with St and involutory, i.e. 

I S t = S- t I, I 2 = 1 (2.1) 

Usually, if x = (p, q), time reversal is simply I(p, q) = (— p, q). 

The main difficulty in studying nonequilibrium statistical Mechanics is 
that, after realizing that one should first understand the properties of sta- 
tionary states, considered as natural extensions of the equilibrium states, it 
becomes clear that the microscopic description cannot be Hamiltonian. 

This is because a current arising from the action of a nonconservative 
force continuously generates "heat" in the system. Heat has to be taken out 
to allow reaching a steady state. This is empirically done by putting the 
system in contact with one or more thermostats. In models, thermostats are 
just forces which act performing work balancing, at least in average, that 
produced by the external forces, i.e. they "model heat extraction". 

It is not obvious how to model a thermostat; and any thermostat model 
is bound to be considered "unphysical" in some respects. This is not sur- 
prising, but it is expected that most models introduced to describe a given 
physical phenomenon should be "equivalent" . 

Sometimes it is claimed that the only physically meaningful thermostats 
for nonequilibrium systems (in stationary states) are made by infinite (3- 
dimensional) systems which, asymptotically at infinity, are in statistical 
equilibrium. In the latter cases it is not even necessary to introduce ad 
hoc forces to remove the heat: motion remains Hamiltonian and heat flows 
towards infinity. 

Although the latter is certainly a good and interesting model, as un- 
derlined already in [13], it should be stressed that it is mathematically in- 
tractable unless the infinite systems are "free", i.e. without internal inter- 
action other than linear, [13, 14, 15, 16, 17]. 

And one can hardly consider such assumption more physical than the 
one of finite thermostats. Furthermore it is not really clear whether a linear 
external dynamics can be faithful to Physics, as shown by the simple one 
dimensional XY-models, see [18] where a linear thermostat dynamics with 
a single temperature leads a system to a stationary state, as expected, but 
the state is not a Gibbs state (at any temperature) . The method followed in 
[18], based on [19], can be used to illustrate some problems which can arise 
when thermostats are classical free systems, see Appendix A4. 
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3 Point of view 

The restriction to finite thermostats, followed here, is not chosen because 
infinite thermostats should be considered unphysical, but rather because it 
is a fact that the recent progress in nonequilibrium theory can be traced to 

(a) the realization of the interest of restricting attention to stationary states, 
or steady states, reached under forcing (rather than discussing approach to 
equilibrium, or to stationarity) . 

(b) the simulations on steady states performed in the 80's after the essential 
role played by finite thermostats was fully realized. 

Therefore investigating finite thermostat models is still particularly im- 
portant. This makes in my view interesting to confine attention on them 
and to review their conceptual role in the developments that took place in 
the last thirty years or so. 

Finite thermostats can be modeled in several ways: but in constructing 
models it is desirable that the models keep as many features as possible 
of the dynamics of the infinite thermostats. As realized in [6, p. 452] it is 
certainly important to maintain the time reversibility. Time reversibility 
expressed by Eq. (2.1), i.e. existence of a smooth conjugation between past 
and future, is a fundamental symmetry of nature which (replaced by TCP) 
even "survives" the so called time reversal violation; hence it is desirable 
that it is saved in models. An example will be discussed later. 

Comment: (1) The second law of equilibrium Thermodynamics, stating ex- 
istence of the state function entropy, can be derived without reference to the 
microscopic dynamics by assuming that heat absorbed at a single tempera- 
ture cannot be cyclically converted into work, [2]. In statistical Mechanics 
equilibrium, states are identified with probability distributions on phase 
space: they depend on control parameters (usually two, for instance en- 
ergy and volume) and processes are identified with sequences of equilibrium 
states, i.e. as curves in the parameters space interpreted as reversible pro- 
cesses. The problem of how the situation, in which averages are represented 
by a probability distribution, develops starting from an initial configuration 
is not part of the equilibrium theory. In this context the second law arises 
as a theorem in Mechanics (subject to asssumptions) and, again, just says 
that entropy exists (the heat theorem). 

(2) As noted in Sec.l, if the scope of the theory is enlarged admitting pro- 
cesses that cannot be represented as sequences of equilibria, called "irre- 
versible processes" , then the postulate of impossibility to convert heat into 
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work extracting it from a single thermostat implies, again without involving 
microscopic dynamics, the inequality often stated as "the entropy of the 
Universe" cannot decrease in passing from an equilibrium state to another. 
And, after properly defining what is meant by irreversible process [2], ac- 
tually strictly increases if in the transformation an irreversible process is 
involved; however perhaps it is best to acknowledge explicitly that such a 
strict increase is a further assumption, [3, p. 1-44-13] leaving aside a lengthy, 
[2], and possibly not exhaustive analysis of how in detail an irreversible 
transformation looks like. Also this second statement, under suitable as- 
sumptions, can become a theorem in Mechanics, [20, 21], but here this will 
not be discussed. 

(3) Therefore studying macroscopic properties for systems out of equilibrium 
can be divided into an "easier" problem, which is the proper generalization 
of equilibrium statistical Mechanics: namely studying stationary states iden- 
tified with corresponding probability distributions yielding, by integration, 
the average values of the few observables of relevance. And the problem of 
approach to a stationary state which is of course more difficult. The recent 
progress in nonequilibrium has been spurred by restricting research to the 
easier problem. 

4 The Chaotic Hypothesis (CH) 

Following Boltzmann and Onsager we can ask whether there are general 
relations holding among time averages of selected observables and for all 
systems that can be modeled by time reversible mechanical equations x = 

/(*)■ 

The difficulty is that in presence of dissipation it is by no means clear 
which is the probability distribution fi a which provides the average values 
of observables, at given control parameters a. 

In finite thermostat models dissipation is manifested by the nonvanishing 

of the divergence, a(x) d = — J29 Xi fi(x), of the equations of motion and of 
its time average a+. 

If cr+ > 1 , it is not possible that the distributions /i a be of the form 
p a (x)dx, "absolutely continuous with respect to the phase space volume": 
since volume contracts, the probability distributions that, by integration, 
provide the averages of the observables must be concentrated on sets, "at- 
tractors" , of volume in phase space. 



x As intuition suggests <r+ cannot be < 0, [22], when motion takes place in a bounded region 
of phase space, as it is supposed here. 
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This means that there is no obvious substitute of the ergodic hypothesis: 
which, however, was essential in equilibrium statistical Mechanics to indicate 
that the "statistics" fi a , i.e. the distribution /j, a such that 



for all x except a set of zero volume, exists and is given by the Liouville 
volume (appropriately normalized to 1) on the surfaces of given energy U 
(which is therefore one of the parameters a on which the averages depend). 2 

It is well known that identifying fi a with the Liouville volume does not 
allow us to derive the values of the averages (aside from a few very simple 
cases, like the free gas): but it allows us to write the averages as explicit 
integrals, [23], which are well suited to deduce relations holding between 
certain averages, like the second law Eq.(l.l) or Onsager reciprocity and 
the more general Fluctuation Dissipation Theorems. 

The problem of finding a useful representation of the statistics of the 
stationary states in systems which are not in equilibrium arose in the more 
restricted context of fluid Mechanics earlier than in statistical Mechanics. 
And through a critique of earlier attempts, [24], in 1973 Ruelle proposed that 
one should take advantage of the empirical fact that motions of turbulent 
systems are "chaotic" and suppose that their mathematical model should 
be a "hyperbolic system" , in the same spirit in which the ergodic hypothesis 
should be regarded: namely while one would be very happy to prove ergod- 
icity because it would justify the use of Gibbs' microcanonical ensemble, real 
systems perhaps are not ergodic but behave nevertheless in much the same 
way and are well described by Gibbs' ensemble..., [25]. 

The idea has been extended in [26, 23] to nonequilibrium statistical Me- 
chanics in the form 

Chaotic hypothesis (CH): Motions on the attracting set of a chaotic 
system can be regarded as motions of a smooth transitive hyperbolic system. 3 

The hypothesis was formulated to explain the result of the experiment in 
[27]. In [26] it was remarked that the CH could be adequate for the purpose. 

2 By Liouville volume we mean the measure S(K(p) + W(q) — U)dpdq, on the manifold of 
constant energy or, in dissipative cases discussed later, the measure tipdq. 

3 Transitive means "having a dense orbit". Note that here this is a property of the attracting 
set, which is often not at all dense in the full phase space. Such systems are also called "Anosov 
systems" . 




(4.1) 
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5 "Free" implications of the Chaotic Hypothesis 

Smooth transitive hyperbolic systems share, independently of the number 
of degrees of freedom, remarkable properties, [28]. 

(1) their motions can be considered paradigmatic chaotic evolutions, whose 
theory is, nevertheless, very well understood to the point that they can play 
for chaotic motions a role alike to the one played by harmonic oscillators for 
ordered motions, [29]. 

(2) there is a unique distribution fi on phase space such that 

lim - f F{S t x)dt= [ n{dy)F(y) (5.1) 

r^oo T Jo J 

for all smooth F and for all but a zero volume set of initial data x, [30, 31, 
23, 28], see Appendix Al. The distribution fi is called the SRB probability 
distribution, see Appendix A2. 

(3) averages satisfy a large deviations rule: i.e. if the point x in / = 
^ Jq F(Stx) dt is sampled with distribution fi, then 

hm 1 log Prob^f € A) = max <>(/) (5-2) 

is an asymptotic value that controls the probability that the finite time 
average of F falls in an interval A = [u,v], u < v, subset of the interval 
(of, &f) of definition of (f- In the interval of definition CH/) i s convex and 
analytic in /, [30, 32]. Outside [a^, bp] the function Cf(7) can be defined to 
have value — oo (which means that values of / in intervals outside [of,&f] 
can possibly be observed only with a probability tending to faster than 
exponentially), [30, 32]. 

(4) A more precise form of Eq.(5.2) yields also the rate at which the limit is 
reached: Prob^f £ A) = e T max /eA <f(/)+0(i) with bounded uniformly 
in r, at fixed distance of A from the extremes ap, &f- This is ofteen written 
in a not very precise but mnemocnically convenient form, as long as its real 
meaning is kept in mind, as 

p^f) = e rC(/)+0(l) (53) 

(5) The fluctuations described by (5.2) are very large fluctuations as they 
have size of order r rather than 0{y/r): in fact if the maximum of Cf(/ ) is at 
a point fo £ (of, bp) and is a nondegenerate quadratic maximum, then Eq. 
(5.2) implies that \fr{f — fo) has an asymptotically Gaussian distribution. 
This means that the motion can be regarded to be so chaotic that the values 
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of F(Stx) are independent enough so that the finite time average deviations 
from the mean value fo are Gaussian on the scale of yfr. 

(6) A natural extension to (5.2) in which several observables Fi,...,F n 
are simultaneously considered is obtained by defining fi = \ f T Fi(Stx)dt. 
Then there exists a convex closed set C C TV 1 and function CF(f) analytic 
in f = (/i, . . . , f n ) in the interior of C and, given an open set A C C, 

^lun i log Prob^f 6 A) = max £ P (f ) (5.4) 

and Cr(f) could be defined as — oo outside C, with the meaning mentioned 
in remark (2). If the function Cf(^) attains its maximum in a point fo in 
the interior of C and the maximum is quadratic and nondegenerate, then 
the joint fluctuations of <p = \fr(f — fo) are asymptotically Gaussian, which 
means that have a probability density -^==^e _ 2 </>) w ith V a positive 
definite n x n matrix. 

(7) The probability distribution fi depends on the control parameters a of 
the initial data and therefore as a varies one obtains a collection of prob- 
ability distributions: this leads to a natural extension of the ensembles of 
equilibrium statistical Mechanics, [23]. 

(8) The most remarkable property, root of all the above, is that the SRB 
probability distribution fi, can be given a concrete formal representation, 
in spite of being a distribution concentrated on a set of zero volume, [30, 
32], see Appendix A1,A2. This raises hopes to use it to derive general 
relations between averages of observables. As in equilibrium, the averages 
with respect to \i are destined to remain not computable except, possibly, 
under approximations (aside very few exactly soluble cases): their formal 
expressions could nevertheless be used to establish general mutual relations 
and properties. 

(9) Given the importance of the existence and representability of the SRB 
distribution, Appendix Al,A2 will be entirely devoted to the formulation 
(Al) and to the physical interpretation of the derivation of its expression: 
this could be useful for readers who want to understand the technical as- 
pects of what follows, because some may find not satisfactory skipping the 
technical details even at a heuristic level. The aim of the non technical dis- 
cussion that follows, preceding the appendices, is to make it worth to invest 
some time on the technical details. 

(10) Applied to a system in equilibrium the CH implies the ergodic hypoth- 
esis so that it is a genuine extension of the latter and any results that follow 
from it will be necessarily compatible with those of equilibrium statistical 
Mechanics, [23]. 
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(11) For very simple systems the distribution fj, can be constructed explicitly 
and time averages of some observables computed. The systems are the dis- 
crete time evolutions corresponding to linear hyperbolic maps of tori, [28], 
or the continuous time geodesic motion on a surface of constant negative 
curvature. The latter systems are rigorously hyperbolic and the SRB dis- 
tribution can be effectively computed for them as well as for their small 
perturbations. 

(12) A frequent remark about the chaotic hypothesis is that it does not seem 
to keep the right viewpoint on nonequilibrium Thermodynamics. It should 
be stressed that the hypothesis is analogous to the ergodic hypothesis, which 
(as well known) cannot be taken as the foundation of equilibrium statistical 
Mechanics, even though it leads to the correct Maxwell Boltzmann statistics, 
because the latter "holds for other reasons" . Namely it holds because in most 
of phase space (measuring sizes by the Liouville measure) the few interesting 
macroscopic observables have the same value, [33], see also [20]. 

6 Paradigms of Statistical Mechanics and CH 

In relation to the last comment is useful to go back to the Heat Theorem 
of Sec.l and to a closer examination of the basic paper of Boltzmann [5], 
in which the theory of equilibrium ensembles is developed and may offer 
arguments for further meditation. The paper starts by illustrating an im- 
portant, and today almost forgotten, remark by Helmoltz showing that very 
simple systems ( "monocyclic systems" ) can be used to construct mechanical 
models of Thermodynamics: and the example chosen by Boltzmann is really 
extreme by all standards. 

He shows that the motion of a Saturn ring of mass m on Keplerian orbits 
of major semiaxis a in a gravitational field of strength g can be used to build 
a model of Thermodynamics. In the sense that one can call 

"volume" V the gravitational constant g, 
"temperature" T the average kinetic energy, 
"energy" U the energy and 

"pressure" p the average potential energy mka^ 1 , 

then one infers that by varying, at fixed eccentricity, the parameters U, V 
the relation (dU +pdV)/T = exact holds. Clearly this could be regarded as 
a curiosity, see [23, Appendix l.Al, Appendix 9. A3]. 

However Boltzmann (following Helmoltz? 4 ) took it seriously and pro- 



4 The relation between the two on this subject should be more studied. Boltzmann's paper of 
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ceeded to infer that under the ergodic hypothesis any system small or large 
provides us with a model of Thermodynamics (being "monocyclic" in the 
sense of Helmoltz): for instance he showed that the canonical ensemble ver- 
ifies exactly the second law of equilibrium Thermodynamics (in the form 
(dU +pdV)/T = exact) without any need to take thermodynamic limits, [5], 
[23]. The same could be said of the microcanonical ensemble (here, how- 
ever, he had to change "slightly" the definition of heat to make things work 
without finite size corrections). 

He realized that the Ergodic Hypothesis could not possibly account for 
the correctness of the canonical (or microcanonical) ensembles; this is clear 
at least from his (later) paper in response to Zermelo's criticism, [38]. Nor 
it could account for the observed time scales of approach to equilibrium. 
Nevertheless he called the theorem he had proved the heat theorem and 
never seemed to doubt that it provided evidence for the correctness of the 
use of the equilibrium ensembles for equilibrium statistical Mechanics. 

Hence there are two points to consider: first certain relations among 
mechanical quantities hold no matter how large is the size of the system 
and, secondly, they can be seen and tested not only in small systems, by 
direct measurements, but even in large systems, because in large systems 
such mechanical quantities acquire a macroscopic thermodynamic meaning 
and their relations are "typical" i.e. they hold in most of phase space. 

The first point has a close analogy in that the consequences of the 
Chaotic Hypothesis stem from the properties of small dimension hyperbolic 
systems (the best understood) which play here the role of Helmoltz' mono- 
cyclic systems of which Boltzmann's Saturn ring ([5]) is a special case. They 
are remarkable consequences because they provide us with parameter free 
relations (namely the Fluctuation Theorem, to be discussed below, and its 
consequences): but clearly it cannot be hoped that a theory of nonequilib- 
rium statistical Mechanics be founded solely upon them, by the same reasons 
why the validity of the second law for monocyclic systems had in principle 
no reason to imply the theory of ensembles. 

Thus what is missing are arguments similar to those used by Boltzmann 
to justify the use of ensembles, independently of the ergodic hypothesis: an 
hypothesis which in the end may appear (and still does appear to many) as 

1884, [5], is a natural follow up and completion of his earlier work [34] which followed [35, 4]. It 
seems that the four extremely long papers by Helmoltz, also dated 1884, [36, 37], might have at 
most just stimulated Boltzmann to revisit his earlier works and led him achieve the completion of 
the mechanical explanation of the second law. Certainly Boltzmann attributes a strong credit to 
Helmoltz, and one wonders if this might be partly due to the failed project that Boltzmann had 
to move to Berlin under the auspices of Helmoltz. 
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having only suggested them "by accident". The missing arguments should 
justify the CH on the basis of an extreme likelihood of its predictions in sys- 
tems that are very large and that may be not hyperbolic in the mathematical 
sense. I see no reason, now, why this should prove impossible a priori or 
in the future. See Sect. 12 for some of the difficulties that can be met in 
experiments testing the CH through its consequence discussed in Sec. 7. 

In the meantime it seems interesting to take the same philosophical view- 
point adopted by Boltzmann: not to consider a chance that all chaotic sys- 
tems share some selected, and remarkable, properties and try to see if such 
properties help us achieving a better understanding of nonequilibrium. Af- 
ter all it seems that Boltzmann himself took a rather long time to realize 
the interplay of the just mentioned two basic points behind the equilibrium 
ensembles and to propose a solution harmonizing them. "All it remains 
to do" is to explore if the hypothesis has implications more interesting or 
deeper than the few known and presented in the following. 

7 The Fluctuation Theorem (FT) 

The idea of looking into time reversibility to explain the experimental results 
of [27] is clearly expressed in the same paper. The CH allows us to use 
effectively time reversal symmetry to obtain what has been called in [26, 
39, 40] the "Fluctuation Theorem" . In fact a simple property holds for all 
transitive hyperbolic systems which admit a time reversal symmetry. 

The property deals with the key observable a(x), which is the above in- 
troduced divergence of the equations of motion, or "phase space contraction 
rate" . Assuming the average phase space contraction to be positive, <r+ > 0, 
let p = ^ Jo" a ^ tx ^ dt be the "dimensionless phase space contraction"; let 
((p) be the large deviation rate function introduced in §5, see Eq.(5.2), for 
F(x) = By time reversal symmetry the interval of analyticity of ((p) 

is centered at the origin and will be denoted (— p*,p*); furthermore p* > 1, 
because the average of p is 1. Then, [26], 

Fluctuation Theorem (FT): The probabilities of the large deviations of 
p satisfy, for all transitive time reversible hyperbolic systems, 

C(-p) = C(p)-p^+ (7-1) 

for all \p\ < p* : this will be called a "fluctuation relation", (FR). 
Remarks: 
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(1) In terms of the notation in Eq.(5.3) the FT is 



PAP) 
Pr(-P) 



= e P°+ r+0(l) 



(7.2) 



which is the form in which it is often written. 

(2) The theorem has been developed, in [26], to understand the results of 
a simulation, [27], whose Authors had correctly pointed out that the SRB 
distribution together with the time reversibility could possibly explain the 
observations. 

(3) Unfortunately the same name, introduced in [26, 39, 40] where FT has 
been proved, has been subsequently picked up and attributed to other state- 
ments, superficially related to the above FT. Enormous confusion ensued 
(and sometimes even errors), see [11, 41, 42]. A more appropriate name for 
such other, and different, statements has been suggested to be "transient 
fluctuation theorems". The above FT should be distinguished also from 
the results in [6] which were the first transient fluctuations results, later ex- 
tended and successfully applied, see [7, 8]. It is claimed that the difference 
between the above FT and the transient statements is just an exchange of 
limits: the point is that it is a nontrivial one, see counterexamples in [11], 
and assumptions are needed, which have a physical meaning; the CH is the 
simplest. 

(4) The FT theorem has been proved first for discrete time evolutions, i.e. 
for maps: in this case the averages over time are expressed by sums rather 
than by integrals. Hyperbolic maps are simpler to study than the corre- 
sponding continuous time systems, which we consider here, because smooth 
hyperbolic maps do not have a trivial Lyapunov exponent (the vanishing 
one associated with the phase space flow direction); but the techniques to 
extend the analysis to continuous time systems are the same as those de- 
veloped in [43] for proving the FT for hyperbolic flows and in this review I 
shall not distinguish between the two kinds of evolutions since the properties 
considered here do not really differ in the two cases. 

(5) The condition cr + > 0, i.e. dissipativity, is essential even to define p 
itself. When the forcing intensity E vanishes also a + — > and the FR loses 
meaning because p does. Neverheless by appropriately dividing both sides of 
Eq. (7.1) by cr+, and then taking the limit, a nontrivial limit can be found 
and it can be shown, at least heuristically, to give the Green-Kubo rela- 
tion for the "current" J d = = [44, 23], generated by the forcing, 
namely 
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dJ_ 

dE E=o 



O / (j( S rX)j{x)) E=0 dt 
^ J — CO 



l r°° 



(7.3) 



which is a general Fluctuation-Dissipation theorem. 

(6) The necessity of a bound p* in FT has attracted undue attention: it 
is obvious that it is there since a(x) is bounded, if CH holds. It also true 
that the role of p* is discussed in the paper [39], which is a formal and 
contemporary version of the earlier [26] and of part of the later [40] written 
for a different audience in mind. 

It is therefore surprising that this is sometimes ignored in the literature 
and the original papers are faulted for not mentioning this (obvious) point, 
which in any event is fully discussed in [39] . A proof which also discusses 
p* is in [45]. It is also obvious that for p > p* the function ((p) can be 
naturally set to be — oo, as commented in remark (6) to the CH in Sec.4, 
and for this reason Eq. (7.1) is often written without any restiction on p. 
This is another point whose misunderstanding has led to errors. For readers 
familiar with statistical Mechanics there is nothing misterious about p*. It 
is analogous the "close packing density" in systems with hard cores: it is 
clear that there is a well defined maximum density but its value is not always 
explicitly computable; and for hiher density many thermodynamic functions 
may be considered defined but as having an infinite value. 

Corollary: [46, 23], Under the same assumptions of FT, if F\ = 

i*2, . . . , F n are n observables of parity e% = ± under time reversal, Fi(Ix) = 

EiFi(x), the large deviations rate defined in Eq. (5.4), satisfies 



where f* = (— f±, e^fi-, ■ ■ ■ , £ n fn)> i n its domain of definition C C lZ n . 

Remark: Note that the r.h.s. of Eq.(7.4) does not depend on /b, ■■■,/«• 
The independence has been exploited in [44] to show that when the forcing 
on the system is due to several forces of respective intensities Ei,...,E s 
then by taking F\ = F<i = dE k cr(x), the Eq.(7.4) implies, setting 

jk(x) = dE k o~(x) and = (jk)^, the Green Kubo relations (hence Onsager 
reciprocity) 



Therefore FT can be regarded as an extension to a nonlinear regime of On- 
sager reciprocity and of the Fluctuation-Dissipation theorems. Such a rela- 



Cp(f) = Cr(f) 



o-+h 



(7.4) 




(7.5) 
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tion was pointed out in the context of volume preserving dynamics (hence in 
absence of dissipation), see comments in [6, p. 452] in particular. But it is not 
clear how to obtain from [6] the dissipative case results in Eq.(7.1),(7.4),(7.5) 
without the CH. 

8 Fluctuation Patterns, Onsager-Machlup Theory 

The last comment makes it natural to inquire whether there are more direct 
and physical interpretations of the FT (hence of the meaning of CH) when 
the external forcing is really different from the value (the value always 
assumed in Onsager's theory). 

The proof of the FT allows, as well, to deduce, [47], an apparently more 
general statement (closely related to a relation recently found in the theory 
of the Kraichnan model of 2-dimensional turbulence and called "multiplica- 
tive" fluctuation theorem, [48]) which can be regarded as an extension to 
nonequilibrium of the Onsager-Machlup theory of fluctuation patterns. 

Consider observables F = (iq d = &/&+, ■ ■ ■ , F n ) which have a well defined 
time reversal parity: Fi(Ix) = e^Fj(x), with ep % = ±1. Let Fi + be their 
time average (i.e. their SRB average) and let t — > <p(t) = (<pi(t), . . . , (f n (t)) 
be a smooth bounded function. Look at the probability, relative to the SRB 
distribution (i.e. in the "natural stationary state") that Fi(Stx) is <fi(t) for 
t € [—§,§]: we say that F "follows the fluctuation pattern" tp in the time 
interval t € [—§,§]■ 

No assumption on the fluctuation size, nor on the size of the forces 
keeping the system out of equilibrium, will be made. Besides the CH we 
assume, however, that the evolution is time reversible also out of equilibrium 
and that the phase space contraction rate <r+ is not zero (the results hold 
no matter how small u+ is and, appropriately interpreted, they make sense 
even if <r+ = 0, but in that case they become trivial). 

We denote £(p, <p) the large deviation function for observing in the time 

interval [— If , |] an average phase space contraction a T d = ^ Jl^/ 2 o~(Stx)dt = 
pa + and at the same time a fluctuation pattern F(Stx) = <p(t). This means 
that the probability that the dimensionless phase space contraction rate p 
is in a closed set A and F is in a closed neighborhood of an assigned i/>, 5 
denoted U^ t£ , is given by: 



J By "closed neighborhood" U^, e , e > 0, around if), we mean that \Fi(S t x) — Vi(*)l < £ for 
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exp( sup T((p,(p)\ (8.1) 

to leading order as r — > oo (i.e. the logarithm of the mentioned probability 
divided by r converges as r — > oo to sup peA vG(7 ^ e <p)). Needless to say 
p and <p have to be "possible" otherwise ( has to be set — oo, as in the FT 
case in Sec. 6, comment (6). 

Given a reversible, dissipative, transitive Anosov flow the fluctuation 
pattern t — ► <p(t) and the time reversed pattern t — ► eFf(-t) are then 
related by the following: 

Conditional reversibility relation: If F = (Fi, . . . , F n ) are n observables 
with defined time reversal parity Ep i = ±1 and if r is iarge the fluctuation 

def 

pattern ip(t) and its time reversal Iifi(t) = £FitPi{—t) will be followed with 
equal likelihood if the first is conditioned to a contraction rate p and the 
second to the opposite —p. This holds because: 

(( P ,<p)-((- P ,Iy) = x for < 

with £ introduced in Eq.(8.1) and a suitable p* > 1. 

It will appear, in Sec. 9, that the phase space contraction rate should 
be identified with a macroscopic quantity, the entropy creation rate. Then 
the last theorem can be interpreted as saying, in other words, that while 
it is very difficult, in the considered systems, to see an "anomalous" aver- 
age entropy creation rate during a time r (e.g. p = —1), it is also true 
that "that is the hardest thing to see". Once we see it, all the observables 
will behave strangely and the relative probabilities of time reversed patterns 
will become as likely as those of the corresponding direct patterns under 
"normal" average entropy creation regime. 

"A waterfall will go up, as likely as we expect to see it going down, in a 
world in which for some reason the entropy creation rate has changed sign 
during a long enough time." We can also say that the motion on an attractor 
is reversible, even in presence of dissipation, once the dissipation is fixed. 

The result in Eq.(8.2) is a "relation" rather than a theorem because, 
even in the hyperbolic cases, the precise restrictions on the "allowed" test 
functions fi(t) have not been discussed in [47] from a strict mathemati- 
cal viewpoint and it would be interesting to formulate them explicitly and 
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investigate their generality. 6 

The result can be informally stated in a only apparently stronger form, 
for \p\ < p* , and with the warnings in remark (4) preceding the analogous 
Eq.(5.3), as 



P r (for all j,andt 6 [-\t, \t] : Fj(S t x) ~ <pj(t)) = ^ pa+ T+0(1) 
P T (for all j, and t £ [-\t,\t] : F j (S t x) <Pj(-t)) 6 

where P T is the SRB probability, provided the phase space contraction a(x) 
is a function of the observables F. This is certainly the case if a is one of 
the Fi, for instance if a = F±. Here Fj(Stx) ~ means \Fj(Stx) — <Pj(i)\ 
small for t £ [—§,§]■ 

Remarks: 

(1) A relation of this type has been remarked recently in the context of the 
theory of Lagrangian trajectories in the Kraichnan flow, [48]. 

(2) One should note that in applications results like Eq.(8.3) will be used 
under the CH and therefore other errors may arise because of its approximate 
validity (the hypothesis in fact essentially states that "things go as if" the 
system was hyperbolic): they may depend on the number N of degrees of 
freedom and we do not control them except for the fact that, if present, 
their relative value should tend to as N — > oo: there may be (and there 
are) cases in which the chaotic hypotesis is not reasonable for small iV (e.g. 
systems like the Fermi-Pasta-Ulam chains) but it might be correct for large 
N. We also mention that, on the other hand, for some systems with small 
N the CH may be already regarded as valid (e.g. for the models in [49], 
[27,50]). 

(3) The proofs of FT and the corollaries are not difficult. Once their meaning 
in terms of coarse graining is understood, the a priori rather misterious 
SRB distribution \i is represented, surprisingly, as a Gibbs distribution for 
a 1-dimensional spin system, which is elementary and well understood. In 
Appendix Al,A2 some details are given about the nature of coarse graining 
and in Appendix A3 the steps of the proof of FT are illustrated. 

In conclusion the FT is a general parameterless relation valid, in time 
reversible systems, independently of the number of degrees of freedom: the 
CH allows us to consider it as a manifestation of time reversal symmetry. 



3 A sufficient condition should be that ifii(i) are bounded and smooth. 
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9 Reversible thermostats and Entropy Creation 

Recalling that kinetic theory developed soon after the time average of a 
mechanical quantity, namely kinetic energy, was understood to have the 
meaning of absolute temperature, it is tempting to consider quite important 
that, from the last three decades of research on nonequilibrium statistical 
Mechanics, an interpretation emerged of the physical meaning of the me- 
chanical quantity a = phase space contraction. 

A system in contact with thermostats can generate entropy in the sense 
that it can send amounts of heat into the thermostats thus increasing their 
entropy by the ratio of the heat to the temperature, because the thermostats 
must be considered in thermal equilibrium. 

Furthermore if phase space contraction can be identified with a physical 
quantity, accessible by means of calorimetric/thermometric measurements, 
then the FT prediction becomes relevant and observable and the CH can 
be subjected to tests, independently on the microscopic model that one may 
decide to assume, which therefore become possible also in real experiments. 

It turns out that in very general thermostat models entropy produc- 
tion rate can be identified with phase space contraction up to a "total time 
derivative": and since additive total time derivatives (as we shall see) do 
not affect the asympotic distribution of time averages, one can derive a FR 
for the entropy production (a quantity accessible to measurement) from a 
FR for phase space contraction (a quantity, in general, not accessible except 
in numerical simulations, because it requires a precise model for the system, 
as a rule not available). 

As an example, of rather general nature, consider the following one, ob- 
tained by imagining a system which is in contact with thermostats that are 
"external" to it. The particles of the system Co are enclosed in a container, 
also called Co, with elastic boundary conditions surrounded by a few ther- 
mostats which consist of particles, all of unit mass for simplicity, interacting 
with the system via short range interactions, through a portion diCo of the 
surface of Co, and subject to the constraint that the total kinetic energy of 
the Ni particles in the i-th. thermostat is K- L = ^X? = ^NiksTi. A symbolic 
illustration is in Fig.l. 
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Fig.l: Particles in Cg ("system particle") interact with the particles in the shaded regions ("ther- 
mostats particles" ) ; the latter are constrained to have a fixed total kinetic energy. 

The equations of motion will be (all masses equal for simplicity) 

mX = - 9x (Vo(X ) + £ W 0J (Xo, Xjj) + E(X ), 

i>o (9.1) 
mX» = - d Xi (UiiXJ + Wb,i(Xo, Xi)) - a.Xi 

with ai such that Ki is a constant. Here Woi is the interaction potential 
between particles in d and in Co, while Uo,Ui are the internal energies of 
the particles in Co,Cj respectively. We imagine that the energies Woj,Uj 
are due to smooth translation invariant pair potentials; repulsion from the 
boundaries of the containers will be elastic reflection. 

It is assumed, in Eq.(9.1), that there is no direct interaction between 
different thermostats: their particles interact directly only with the ones in 
Cq. Here E(Xo) denotes possibly present external positional forces stirring 
the particles in Cq. The contraints on the thermostats kinetic energies give 

cci = ,17 <— ► Ki = const d = f -Nik B Ti (9.2) 

3Nik B Ti 2 v ' 

where Qi is the work per unit time that particles outside the thermostat Cj 
(hence in Co) exercise on the particles in it, namely 

Qi= f -Xi-a^Wb.iCXo.XO (9.3) 

and it will be interpreted as the "amount of heaf Qi entering the thermostat 
Cj per unit time. 

The main feature of the model is that the thermostats are external to 
the system proper: this makes the model suitable for the study of situations 
in which no dissipation occurs in the interior of a system but it occurs only 
on the boundary. 
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The divergence — <r(X, X) of the equations of motion, which gives the 
rate of contraction of volume elements around dXdX, does not vanish and 
can be computed in the model in Fig.l; simple algebra yields, remarkably, 

<r(X,X) =e(X,X)+i?(X), 

£ (x,x)=V-%, i?(x) = V-% ^ 

where e(X, X) can be interpreted as the entropy production rate, because of 
the meaning of Q, L in Eq.(9.3). 7 

This is an interesting result because of its generality: it has implications 
for the thermostated system considered in Fig.l, for instance. It is remark- 
able that the quantity p has a simple physical interpretation: Eq.(9.1) shows 
that the functions Q a {p) and Ce(p) are identical because, since R is bounded 
by our assumption of smoothness, Eqs. (9.2) and (9.3) imply 



- f T a(S t (X,X))dt = - f T e(S t (X,X))dt + ^ (r) R{ °\ (9.5) 

T JO T Jo T 



so that 



a + = lim - f a(S t (X,X))dt = lim - F e(S t (X, X))dt = e + (9.6) 

r~*oo T Jo r^oo T Jq 

and the asmptotic distributions of 

. 1 f T a(S t (X,X)) , , c 1 r e(St(X,X)) , 

p' = - v tv ' — ildt, and of p = - — —dt (9.7) 
t Jo o+ r Jo e+ 

are the same. 

The Eq.(9.1) are time reversible (with I(X,X) = (-X,X)): then under 
the CH the large deviations rate C(p) f° r t ne observable -fj- satisfies the 
"fluctuation relation", Eq.(7.1). It also follows that the large deviations 
rate for identical to ((p), satisfies it as well. 



7 Eq.(9.4) are correct up to 0(N~ 1 ) if N = minA'j because the addends should contain 
also a factor (1 — 3^7) to be exact: for simplicity 0(l/N) corrections will be ignored here 
and in he following (their inclusion would imply trivial changes without affecting the physical 
interpretation), [51]. 
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The point is that e is measurable by "calorimetric and thermometric 
measurements", given its interpretation of entropy increase of the ther- 
mostats. Therefore the CH can be subjected to test or it can be used to 
"predict" the frequency of occurence of unlikely fluctuations. 

Comment: This is a rather general example of thermostats action, but it 
is just an example. For instance it can be generalized further by imagining 
that the system is thermostatted in its interior. A situation that arises nat- 
urally in the theory of electric conduction. In the latter case the electrons 
move across the lattice of the metal atoms and the lattice oscillations, i.e. 
the phonons, absorb or give energy. This can be modeled by adding a "in- 
ner" thermostat force — «oXj, acting on the particles in Co, which fixes the 
temperature of the electron gas. Actualy a very similar model appeared in 
the early days of Statistical Mechanics, in Drude's theory of electric conduc- 
tivity, [52]. Other examples can be found in [51]. 



10 Fluids 

The attempt to put fluids and turbulence within the context provided by 
the ideas exposed in the previous sections forces to consider cases in which 
dissipation takes place irreversibly. This leads us to a few conjectures and 
remarks. 

To bypass the obstacle due to the nonreversibility of the fluid equations 
which, therefore, seem quite far from the equations controlling the ther- 
mostated systems just considered, the following "equivalence conjecture", 
[53], has been formulated. Consider the two equations for an incompressible 
flow with velocity field u(x, t), d ■ u = 0, in periodic boundary condition for 
simplicity, 

ii + u- 9u = uAu — dp + g, 

(10.1) 

u + u • du = a(u)Au — dp + g, 

where a(u) = j ( -g u - )2 dx 1S a "Lagrange multiplier" determined so that the 

total energy £ d = J u 2 dx is exactly constant. 

Note that velocity reversal I : u(x) — ► — u(x) anticommutes, in the 
sense of Eq. (2.1), with the time evolution generated by the second equation 
(because a(Iu) = — a(u)), which means that "fluid elements" retrace their 
paths with opposite velocity. 
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Introduce the "local observables" F(u) as functions depending only upon 
finitely many Fourier components of u, i.e. on the "large scale" properties 
of the velocity field u. Then, conjecture, [54], the two equations should have 
"same large scale statistics" in the limit R — ► +oo. If fj, v and Jig denote 
the respective SRB distributions of the first and the second equations in Eq. 
(10.2), by "same statistics" as R — > oo it is meant that 

(1) if the total energy £ of the initial datum u(0) for the second equation is 
chosen equal to the average (/ u 2 dx) for the SRB distribution fi v of the 
first equation, then 

(2) the two SRB distributions fi u and fig are such that, in the limit R — > oo, 



So far only numerical tests of the conjecture, in strongly cut off 2- 
dimensional equations, have been attempted ([55]). 

An analogy with the termodynamic limit appears naturally: namely 
the Reynolds number plays the role of the volume, locality of observables 
becomes locality in k-space, and u, £ play the role of canonical temperature 
and microcanonical energy of the SRB distributions of the two different 
equations in (10.1), respectively \i v and jig. 

The analogy suggests to question whether reversibility of the second 
equation in Eq.(lO.l) can be detected. In fact to be able to see for a large 
time a viscosity opposite to the value v would be very unphysical and would 
be against the spirit of the conjecture. 

If the CH is supposed to hold it is possible to use the FT, which is a 
consequence of reversibility, to estimate the probability that, say, the value 
of a equals —v. For this purpose we have to first determine the attracting 
set. 

Assuming the K41, [53], theory of turbulence the attracting set will be 
taken to be the set of fields with Fourier components = unless |k| < Ri . 

Then the expected identity (a) = u, between the average friction (a) 
in the second of Eq.(lO.l) and the viscosity v in the first, implies that the 
divergence of the evolution in the second of Eq.(lO.l) is in average 



the difference (F) 



*0. 



a ~ v 




(10.2) 



By FT the SRB-probability to see, in motions following the second equa- 
tion in Eq. (10.2), a "wrong" average friction — v for a time r is 
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3271"^ 11 rl f 

Prob srfe ~ exp ( - rv^jjR-i ) =' e^ T (10.3) 
It can be estimated in the situation considered below for a flow in air: 

9 cm 2 cm 
i/=1.510~ 2 , u = 10. L = 100. cm 

sec sec 

i?=6.6710 4 , 5 = 3.66 10 14 sec" 1 ( 10 - 4 ) 

. P = f Prob srb = e-o* = e" 3 - 66 10 * , if r = KT 6 

where the first line are data of an example of fluid motion and the other 
two lines follow from Eq.(10.3). They show that, by FT, viscosity can be 
— v during 10~ 6 s (say) with probability P as in Eq.(10.4): unlikelyhood is 
similar in spirit to the estimates about Poincare's recurrences, [53]. 

(2) If we imagine that the particles are so many that the system can be 
well described by a macroscopic equation, like for instance the NS equation, 
then there will be two ways of computing the entropy creation rate. The 
first would be the classic one described for instance in [56], and the second 
would simply be the divergence of the microscopic equations of motion in the 
model of Fig.l, under the assumption that the motion is closely described by 
macroscopic equations for a fluid in local thermodynamic equilibrium, like 
the NS equations. This can be correct in the limit in which space and time 
are rescaled by e and e 2 and the velocity field by e, and e is small. Since 
local equilibrium is supposed, it will make sense to define a local entropy 
density s(x) and a total entropy of the fluid S = J s(x) dx. 

The evaluation of the expression for the entropy creation rate as a di- 
vergence a of the microscopic equations of motion leads to, [57], a value (e) 
with average (over a microscopically long time short with respect to the time 
scale of the fluid evolution) related to the classical entropy creation rate in 
a NS fluid as 

^-B( e ) = kB^classic + S, 

u f ( < dT ^^ 1 ' a \j (10 - 5) 

kB£dassic= J c ( K (^r) +Vj;I -dujdx 

where r' is the tensor (diUj + djUi) and r\ is the dynamic viscosity, so that 

the two expressions differ by the time derivative of an observable, which 
equals the total equilibrium entropy of the fluid S = Js(x)dx where s is 
the thermodynamical entropy density in the assumption of local equilibrium; 
see comment on additive total derivatives preceding Fig.l. 
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11 Quantum Systems 

Recent experiments deal with properties on mesoscopic and atomic scale. In 
such cases the quantum nature of the systems may not always be neglected, 
paricularly at low temperature, and the question is whether a fluctuation 
analysis parallel to the one just seen in the classical case can be performed 
in studying quantum phenomena. 

Thermostats have, usually, a macroscopic phenomenological nature: in 
a way they should be regarded as classical macroscopic objects in which 
no quantum phenomena occur. Therefore it seems natural to model them 
as such and define their temperature as the average kinetic energy of their 
constituent particles so that the question of how to define it does not arise. 

Consider the system in Fig. 1 when the quantum nature of the particles in 
Cq cannot be neglected. Suppose for simplicity (see [58]) that the nonconser- 
vative force E(Xo) acting on Co vanishes, i.e. consider the problem of heat 
flow through Cq. Let H be the operator on L2(Cq N °), space of symmetric or 
antisymmetric wave functions ^(Xq), 



H = -^A Xo + ^o(Xo) + ]T (^(Xo.Xj) + Uj(Xj) + Kj) (11.1) 

j>o 

where Ax is the Laplacian, and note that its spectrum consists of eigen- 
values E n = £^ n ({Xj}j>o), for Xj fixed (because the system in Cq has finite 
size). 

A system-reservoirs model can be the dynamical system on the space of 
the variables (\I>, ({Xj}, {Xj})j>o) defined by the equations (where (-)^ = 
expectation in the state *) 



-iMf(Xo) = (fT*)(Xo), and for j > 

*i = " (djUjiXj) + (^(XcX,-))*) - ajXj (1L2) 

n dej (Wj)^ - Uj def • , , 

j ~ 2K ' j ~ ~ j ' d i U °i( > 

here the first equation is Schrodinger's equation, the second is an equation 
of motion for the thermostats particles similar to the one in Fig.l, (whose 
notation for the particles labels is adopted here too). The model has no 
pretention of providing a physically correct representation of the motions in 
the thermostats nor of the interaction system thermostats, see comments at 
the end of this section. 
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Evolution maintains the thermostats kinetic energies Kj = ^X^ exactly 
constant, so that they will be used to define the thermostats temperatures 
Tj via Kj = ^ksTjNj, as in the classical case. 

Let no({d^}) be the formal measure on L2(Cq N °) 

nd*r(Xo)d*i(Xo))$(/ |*(Y)| 2 dY — (11.3) 

X C ° 

with ty r ,tyi real and imaginary parts of *S>. The meaning of (11.3) can be 
understood by imagining to introduce an orthonormal basis in the Hilbert 
space and to "cut it off" by retaining a large but finite number M of its 
elements, thus turning the space into a high dimensional space C M (with 
2M real dimensions) in which = d^ r (Xo) d^j(Xo) is simply interpreted 
as the normalized euclidean volume in C M . 

The formal phase space volume element ^o({d^/}) x is(dX.d%) with 

u(dX d±) d = f ]J (<5(X| - SNjksTj) dXj dXj) (11.4) 

j>0 

is conserved, by the unitary property of the wave functions evolution, just 
as in the classical case, up to the volume contraction in the thermostats, [51]. 

def 

If Qj = (Wj)^ and R is as in Eq.(9.4), then the contraction rate a 
of the volume element in Eq.(11.4) can be computed and is (again) given 
by Eq.(9.4) with e, that will be called entropy production rate: setting 

^X^E^o^it is 



*(*, X, X) = e(*, X, X) + R(X), e(V, X, X) = ^ (11.5) 

,>o ksT i 

In general solutions of Eq. (11.2) will not be quasi periodic and the Chaotic 
Hypothesis, [40, 23, 58], can be assumed: if so the dynamics should select 
an SRB distribution fj,. The distribution fi will give the statistical prop- 
erties of the stationary states reached starting the motion in a thermostat 
configuration (Xj, Xj)j>o, randomly chosen with "uniform distribution" v 
on the spheres mXj = 3NjksTj and in a random eigenstate of H. The 
distribution fx, if existing and unique, could be named the SRB distribution 
corresponding to the chaotic motions of Eq.(11.2). 

In the case of a system interacting with a single thermostat at tem- 
perature T\ the latter distribution should be equivalent to the canonical 
distribution, up to boundary terms. 
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Hence an important consistency check, for proposing Eq.(11.2) as a 
model of a thermostated quantum system, is that there should exist at least 
one stationary distribution equivalent to the canonical distribution at the 
appropriate temperature T\ associated with the (constant) kinetic energy of 
the thermostat: K\ = §fceTi N\. In the corresponding classical case this is 
an established result, [59, 23, 51]. 

A natural candidate for a stationary distribution could be to attribute 
a probability proportional to d^Sf cZXi dXi times 

oo 

£ e-^ E "5(V - tf n (Xi) e*^) dtp n <5(X? - 2KJ (11.6) 

71=1 

where /3i = l/fc^Ti, ^ are wave functions for the system in Cq, Xi,Xi 
are positions and velocities of the thermostat particles and <p n £ [0, 2ir] 
is a phase, E n = E n (X.i) is the n-th level of H(X.i), with ^ n (Xi) the 
corresponding eigenfunction. The average value of an observable O for the 
system in Co in the distribution /x in (11.6) would be 

(O)^ = Z- 1 J Tr (e-^^O) <5(Xf - 2 J fcT 1 )eZX 1 dXi (11.7) 

where Z is the integral in (11.7) with 1 replacing O, (normalization factor). 
Here one recognizes that fj, attributes to observables the average values cor- 
responding to a Gibbs state at temperature T\ with a random boundary 
condition Xi. 

However Eq.(11.6) is not invariant under the evolution Eq.(11.2) and it 
seems difficult to exhibit explicitly an invariant distribution. Therefore one 
can say that the SRB distribution for the evolution in (11.2) is equivalent 
to the Gibbs distribution at temperature T\ only as a conjecture. 

Nevertheless it is interesting to remark that under the adiabatic approx- 
imation the eigenstates of the Hamiltonian at time evolve by simply fol- 
lowing the variations of the Hamiltonian H(X.(t)) due to the motion of 
the thermostats particles, without changing quantum numbers (rather than 
evolving following the Schrodinger equation and becoming, therefore, differ- 
ent from the eigenfunctions of H(X.(t))). 

In the adiabatic limit in which the classical motion of the thermostat 
particles takes place on a time scale much slower than the quantum evolution 
of the system the distribution (11.6) is invariant. 

This can be checked by first order perturbation analysis which shows 
that, to first order in t, the variation of the energy levels (supposed non de- 
generate) is compensated by the phase space contraction in the thermostat, 
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[58]. Under time evolution, Xi changes, at time t > 0, into Xi +tXi + 0(t 2 ) 
and, assuming non degeneracy, the eigenvalue E n (X\) changes, by pertur- 
bation analysis, into E n + t e n + 0(t 2 ) with 

e n = f tfc ■ dxMy +tX 1 -d Xl U 1 = -t((W 1 ) iB + R 1 ) = -l- ai . (11.8) 

n Pi 

Hence the Gibbs factor changes by e"' 3 * 6 ™ and at the same time phase space 

contracts by e 2if i , as it follows from the expression of the divergence in 
Eq.(11.5). Therefore if (3 is chosen such that (3 = (fc^Ti)^ 1 the state with 
distribution Eq.(11.6) is stationary, (recall that for simplicity 0(1/N), see 
footnote 7 on p. 22 is neglected). This shows that, in the adiabatic approxima- 
tion, interaction with only one thermostat at temperature T\ admits at least 
one stationary state. The latter is, by construction, a Gibbs state of ther- 
modynamic equilibrium with a special kind (random Xi,Xi) of boundary 
condition and temperature T±. 

Remarks: (1) The interest of the example is to show that even in quantum 
systems the chaotic hypothesis makes sense and the intepretation of the 
phase space contraction in terms of entropy production remains unchanged. 
In general, under the chaotic hypothesis, the SRB distribution of (11.2) 
(which in presence of forcing, or of more than one thermostat is certainly 
quite not trivial, as in the classical Mechanics cases) will satisfy the fluctua- 
tion relation because the fluctuation theorem only depends on reversibility: 
so the model (11.2) might be suitable (given its chaoticity) to simulate the 
steady states of a quantum system in contact with thermostats. 

(2) It is certainly unsatisfactory that a stationary distribution cannot be 
explicitly exhibited for the single thermostat case (unless the adiabatic ap- 
proximation is invoked). However, according to the proposed extension of 
the CH, the model does have a stationary distribution which should be 
equivalent (in the sense of ensembles equivalence) to a Gibbs distribution at 
the same temperature. 

(3) The non quantum nature of the thermostat considered here and the spe- 
cific choice of the interaction term between system and thermostats should 
not be important: the very notion of thermostat for a quantum system is 
not at all well defined and it is natural to think that in the end a thermostat 
is realized by interaction with a reservoir where quantum effects are not im- 
portant. Therefore what the analysis really suggests is that in experiments 
in which really microscopic systems are studied the heat exchanges of the 
system with the external world should fulfill a FR. 
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(4) The conjecture can probably be tested with present day technology. If 
verified it could be used to develop a "Fluctuation Thermometer" to perform 
temperature measurements which are device independent in the same sense 
in which the gas thermometers are device independent (i.e. do not require 
"calibration" of a scale and "comparison" procedures). 
Consider a system in a stationary state, and imagine inducing small currents 
and measuring the average heat output rate Q+ and the fluctuations in the 
finite time average heat output rate, generated by inducing small currents, 
i.e. fluctuations of p = ^ Jq ^j^-dt obtaining the rate function of ((p). 

Then it becomes possible to read from the slope of ((p) — ((— p), equal to 
by the FR, directly the inverse temperature that the thermostat in contact 
with the system has: this could be useful particularly in very small systems 
(classical or quantum) . The idea is inspired by a similar earlier proposal for 
using fluctuation measurements to define temperature in spin glasses, [60], 
[61, p.216]. 

12 Experiments ? 

The (partial) test of the chaotic hypothesis via its implication on large fluc- 
tuations probabilities (i.e. the fluctuation relation) is quite difficult. The 
main reason is that if the forcing is small the relation degenerates (because 
e + — > 0) and it can be shown, [44], that to lowest nontrivial order in the size 
of the forcing it reduces to the Green-Kubo formula, which is (believed to 
be) well established so that the fluctuation relation will not be significant, 
being "true for other reasons", [56]. See Sec. 3. 

Hence one has to consider large forcing. However, under large forcing, 
large fluctuations of p become very rare, hence their statistics is difficult 
to observe. Furthermore the statistics seems to remain Gaussian for p, in 
a region around p = 1 where the data can be considered reliably unbiased 
(see below), and until rather large values of the forcing field or values of 
\p— 1| large compared to the root mean square deviation ^= = ((p — l) 2 ) 1 / 2 

are reached. Hence ((p) = —^yi(p — l) 2 and linearity in p of ((p) — ((—p) 
is trivial. Nevertheless, in this regime, it follows that -^7 = 0+ which is a 
nontrivial relation and therefore a simple test of the fluctuation relation. 

The FR was empirically observed first in such a situation in [27], in a 
simulation, and the first dedicated tests, after recognizing its link with the 
CH, were still performed in a Gaussian regime, so that they were really only 
tests of = o+ and of the Gaussian nature of the observed fluctuations. 

Of course in simulations the forcing can be pushed to "arbitrarily large" 
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values so that the fluctuation relation can, in principle, be tested in a regime 
in which ((p) is sensibly non Gaussian, see [62]. But far more interesting 
will be cases in which the distribution ((p) is sensibly not Gaussian and 
which deal with laboratory experiments rather than simulations. Skepticism 
towards the CH is mainly based on the supposed non measurability of the 
function ((p) in the large deviation domain (i.e. \p — 1| S> \J ((p — l) 2 ))- 
In experimental tests several other matters are worrysome, among which: 

(a) is reversibility realized? This is a rather stringent and difficult point 
to understand on a case by case basis, because irreversibility creeps in, in- 
evitably, in dissipative phenomena. 

(b) is it allowed to consider R, i.e. the "entropy production remainder" in 
(9.3), bounded? if not there will be corrections to FR to study (which in 
some cases, [63, 64], can be studied quite in detail). 

(c) does one introduce any bias in the attempts to see statistically large 
deviations? for instance in trying to take r large one may be forced to look 
at a restricted class of motions, typically the ones that remain observable 
for so long a time. It is easy to imagine that motions observed by optical 
means, for instance, will remain within the field of the camera only for a 
characteristic time tq so that any statistics on motions that are observed for 
times r > ro will be biased (for it would deal with untypical events). 

(d) chaotic motions may occur under influence of stochastic perturbations, so 
that extensions of FT to stochastic systems may need to be considered. This 
is not really a problem because a random perturbation can be imagined as 
generated by coupling of the system to another dynamical system (which, 
for instance, in simulations would be the random number generator from 
which the noise is drawn), nevertheless it demands careful analysis, [65]. 

(e) Nonconvex shape of ((p), at \p — 1| beyond the root mean square de- 
viation, see Fig.3, is seen often, possibly always, in the experiments that 
have been attempted to study large deviations. Therefore the interpreta- 
tion of the nonconvexity, via well understood corrections to FR, seems to 
be a forced path towards a full test of the FR, beyond the Gaussian regime, 
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[64]. 
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Fig. 3: An histogram of logP T (p), taken from the data of [66] at time r — IOtc — 200ms: it 
shows the rather typical nonconvcxity for \p — 1|~8 which is of the order of standard deviation. 

All the above questions arise in the recent experiment by Bandi-Cres- 
sman-Goldburg, [66]. It encounters all the related difficulties and to some 
extent provides the first evidence for the FR (hence the CH) in a system 
in which the predictions of the FR are not the result of a theoretical model 
which can be solved exactly. The interpretation of the results is difficult and 
further investigations are under way. 

The experiment outcome is not incompatible with FR and, in any event, 
it proves that good statistics can be obtained for fluctuations that extend 
quite far beyond the root mean square deviation of p — 1: an asset of the 
results in view of more refined experiments. 

A very promising field for experimental tests of the CH and the FR is 
granular materials: in granular materials collisions are not elastic, never- 
theless an experiment is proposed in [67]. See comment (6) in Sec. 13 and 
comment (4) to Eq. (11.8) for other hints at possible experiments and ap- 
plications. 

13 Comments 

(1) In the context of the finite thermostats approach, besides systems of 
particles subject to deterministic evolution, stochastically evolving systems 
can be considered and the FT can be extended to cover the new situations, 
[68, 69, 70, 48, 65]. 

(2) Alternative quantum models have also been considered in the literature, 
[71] (stochastic Langevin thermostats), or infinite thermostats (free and in- 
teracting, and possibly with further noise sources) [13, 72, 17, 16, 73]. 

(3) Many simulations have been performed, starting with the experiment 
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which showed data that inspired the FT, [27], and continuing after the proof 
of FT and the formulation of the CH, e.g. [50]: a few had the purpose of 
testing the Fr in a nongaussian regime for the fluctuations of the variable 
p, [62]. In some cases the results had to be examined closely to understand 
what was considered at discrepancy with the FT, [64], (and was not). 

(4) The physical relevance of the particular quantum thermostat model re- 
mains an open question and essentially depends on the conjecture that the 
(unknown) SRB distribution for the model in the single thermostat case is 
equivalent to the Gibbs distribution at the same temperature (a property 
valid in the corresponding classical cases). Hence the main interest of the 
model is that it shows that a FR is in principle possible in finite thermostated 
quantum systems in stationary state. 

(5) Few experiments have so far been performed (besides numerical simu- 
lations) to investigate CH and FT: extensions to randomly forced systems 
are possible, [68, 69, 70], and can be applied to systems that can be studied 
in laboratory, [74, 66]: the first experiment designed to test the FR in a 
laboratory experiment is the recent work [66]. The results are consistent 
with the FR and indicate a promising direction of research. 

(6) An interesting consequence of the FT is that 

(e- AS/kB ) S r b =(e~ IoTEj>0 ^dt) srb = O(l) (13.1) 

in the sense that the logarithms of both sides divided by r agree in the limit 
r — > oo (i.e. lim T ^ +00 ^ log ( e AS / kB ) = 0) with corrections of order O(^). 
This has been pointed out by Bonetto, see [23], and could have applications 
in the same biophysics contexts in which the work theorems, [7, 8], have 
been applied: for instance one could study stationary heat exchanges is sys- 
tems out of equilibrium (rather than measure free energy differences between 
equilibrium states at the same temperature as in [7, 8]). The boundedness 
of the l.h.s. of Eq. (13.1) implied by (13.1) can be used to test whether some 
heat emissions have gone undetected (which would imply that the l.h.s. of 
Eq.(13.1) tends to 0, rather than staying of O(l)). This is particularly rel- 
evant as in biophysics one often studies systems in stationary states while 
actively busy at exchanging heat with the sourroundings. 

(7) Another property, which is not as well known as it deserves, is that for 
hyperbolic systems, and by the Chaotic Hypothesis of Sec. 2, virtually for 
all chaotic evolutions, it is possible to develop a rigorous theory of coarse 
graining, [75, 12]. It leads to interpreting the SRB distributions as uniform 
distributions on the attractor; hence to a variational principle and to the 
existence of a Lyapunov function describing the approach to the stationary 
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state, i.e. giving a measure of the distance from it, [21, 57]. 
However it also seems to lead to the conclusion that entropy of a stationary 
state cannot be defined if one requires that it should have properties closely 
analogous to the equilibrium entropy. For instance once coarse graining has 
been properly introduced, it is tempting to define the entropy of a stationary 
state as ks times the logarithm of the number of "microcells" into which 
the attractor is decomposed, see Appendix A1,A2. 

This quantity can be used as a Lyapunov function, see [57], but it depends 
on the size of the microcells in a nontrivial way: changing their size, the 
variation of the so defined entropy does not change by an additive constant 
depending only on the scale of the coarse graining (at difference with respect 
to the equilibrium case), but by a quantity that depends also on the control 
parameters (e.g. temperature, volume etc. ), [21]. 

Given the interest of coarse graining, in Appendix Al mathematical details 
about it are discussed in the context of the SRB distribution and CH; and 
a physical interpretation is presented in Appendix A2; hopefully they will 
also clarify the physical meaning of the two. 

(8) Finally it is often said that the FR should hold always or, if not, it is 
incorrect. In this respect it has to be stressed that the key assumption is 
the CH, which implies the FR exactly in time reversible situations. However 
it is clear that CH is an idealization and the correct attitude is to interpret 
deviations from FR in terms of corrections to the CH. For instance: 

CH implies exponential decay of time correlations. But in some cases there 
are physical reasons for long range time correlations. 

Or the CH implies that observables have values in a finite range. But there 
are cases in which phase space is not bounded and observables can take 
unbounded values (or such for practical purposes). 
Time reversal is necessary. But there are cases in which it is violated. 
The pdf of p should be log-convex: but it is seldom so. 

What is interesting is that it appears that starting from CH and examin- 
ing the features responsible for its violations it may be possible to compute 
even quantitatively the corrections to FR. Examples of such corrections 
already exist, [63, 64, 76]. It would be interesting to have a concrete exper- 
iment, designed to test FR and try to understand the observed deviations; 
the BCG experiment in Sec. 12 offers, if further developed, the possibility 
of simple tests making use the existing experimental apparatus and of the 
observations that it has proved to be accessible. 
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14 Al: Coarse Graining, SRB and ID Ising Mod- 
els 

In equilibrium phase space volume is conserved and it is natural to imagine 
it divided into tiny "cells", in which all observables of interest are constant. 
The equilibrium distribution can be constructed simply by imagining to 
have divided phase space E ("energy surface") into cells of equal Liouville 
volume, small enough so that every interesting physical observable F is 
constant in each cell. Then the dynamics is a cyclic permutation of the cells 
(ergodic hypothesis) so that the stationary distribution is just the volume 
distribution. 

In a way, this is an "accident", based on what appears to be a funda- 
mentally incorrect premise, which leads to various difficulties as it is often 
considered in the context of attempts to put on firm grounds the notion 
of a "coarse grained" description of the dynamics. Confusion is also added 
by the simulations: the latter are sometimes interpreted as de facto coarse 
grained descriptions. It seems, however, essential to distinguish between 
coarse graining and representation of the dynamics as a permutation of 
small but finite cells. 

Undoubtedly dynamics can be represented by a permutation of small 
phase space volumes, as any simulation program effectively does. But it is 
also clear that the cells used in the simulations are far too small (i.e. of the 
size determined by the computer resolution, typically of double precision 
reals) to be identified with the coarse cells employed in phenomenological 
studies of statistical Mechanics. 

On the other hand if coarse grain cells are introduced which are not as 
tiny as needed in simulations the dynamics will deform them to an extent 
that after a short time it will no longer be possible to identify which cell has 
become which other cell! And this applies even to equilibrium states. 

In this respect it looks as an accident the fact that, nevertheless, at least 
in equilibrium a coarse grained representation of time evolution appears 
possible. And easily so, with small cells subject to the only condition of 
having equal volume; but the huge amount of literature on attempts at 
establishing a theory of coarse graining did not lead to a precise notion, nor 
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to any agreement between different proposals. 

Under the CH systems are hyperbolic and a precise analysis of coarse 
graining seems doable, see [21, 29] and [77]. The key is that it is possible to 
distinguish between "microcells" , so tiny that evolution is well approximated 
by a permutation on them, and "cells" which are still so small that the (few) 
interesting observables have constant value on them. The latter cells can be 
identified with "coarse grain cells" ; yet they are very large compared to the 
microcells and time evolution cannot be represented as their permutation. 
Neither in equilibrium nor out of equilibrium. 

That SRB distribution cannot be considered a permutation of naively 
defined coarse cells seems to be well known and to have been considered 
a drawback of the SRB distributions: it partly accounts for the skepticism 
that often, still now, accompanies them. 

The point that will be made, see the review [77], is that hyperbolicity 
provides us with a natural definition of coarse grained cells. At the same 
time it tells us which is the weight to be given to each cell which, in turn, 
implies that each cell can be imagined containing many "microcells" whose 
evolution is a simple permutation of them (just as in numerical simulations). 

In this appendix we consider for simplicity discrete time systems: in this 
case hyperbolic systems are described by a smooth map S, transitive and 
smoothly invertible, with the property that every phase space point x is 
a "saddle point". Out of x emerge the stable and the unstable manifolds 
W s (x), W u (x) of complementary dimension. The expansion and contrac- 
tion that take place near every point x can be captured by the matrices 
dS u (x), dS s (x) obtained by restricting the matrix (Jacobian matrix) dS(x), 
of the derivatives of S, to its action on the vectors tangent to the unsta- 
ble and stable manifolds through x: the evolution S maps W u (x),W s (x) 
to W u (Sx), W s (Sx), respectively, and its derivative (i.e. its linearization) 
maps tangent vectors at x into tangent vectors at Sx. 

A quantitative expression of the expansion and contraction is given by 
the "local expansion" or "local contraction" rates defined by 



A?(x) = f \og\det(dS) u (x)\, A? (a) = f - log | det(05) s (s)|. (14.1) 

Since time is now discrete, phase space contraction is now defined as o~(x) = 
— log | det(dS)| and related to A"(x), Af (x) by 

a(x) = -A?(x) + AfOr) - log (14.2) 
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where S(x) is the angle (in the metric chosen in phase space) between 
W s (x), W u {x) (which is bounded away from and ir by the smoothness 
of the hyperbolic evolution S). 

This suggests to imagine constructing a partition V of phase space into 
closed regions V = (P±, . . . ,P m ) with pairwise disjoint interiors, each of 
which is a "rectangle" defined as follows. 

The rectangle Pi, see the following Fig. 5 for a visual guide, has a center 
Ki out of which emerge portions C C W s (k,i),D C W u (ni) of its stable 
and unstable manifolds, small compared to their curvature, which form the 
"axes" of Pi, see Fig.5. The set Pi, then, consists of the points x obtained 
by taking a point p in the axis D and a point q in the axis C and setting 
x=W s {p) n W u (q), just as in an ordinary rectangle a point is determined 
by the intersection of the lines through any two points on the axes and 
perpendicular to them, see Fig.5. The symbol = means that x is the point 
closest to p and to q along paths in W s (p) and, respectively W u (q). s 

Note that in a rectangle anyone of its points k, could be the center in 
the above sense with a proper choice of C, D, so that K; t does not play a 
special role and essentially serves as a label identifying the rectangle. In 
dimension higher than 2 the rectangles may (and will) have rather rough 
(non differentiable) boundaries, [78]. 




Fig.5: A rectangle P with a pair of axes C, D crossing at the corresponding center re. 

It is a key property of hyperbolicity (hence of systems for which the CH 
can be assumed) that the partition V can be built to enjoy of a very special 
property. 

def 

Consider the sequence, history of x, £(x) = {£,i}'?2 = _ 00 of symbols telling 
into which of the sets of V the point S l x is, i.e. where x is found at time i, 
or S l x £ . This is unambiguous aside from the zero volume set B of the 

8 This proviso is needed because often, and certainly in transitive hyperbolic maps, the full 
manifolds W s (p), W u (q) are dense in phase space and intersect infinitely many times, [30, 32], 
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points that in their evolution fall on the common boundary of two P^s. 

Define the matrix Q to be Q^g = 0, unless there is an interior point in 
P^ whose image is in the interior of P^r. and in the latter case set Q^g = 1- 
Then the history of a point x, which in its evolution does not visit a boundary 
common to two P{s, must be a sequence £ verifying the property, called 
compatibility, that, Q( k ,$ tk+1 = 1 for all times k. 

The matrix Q tells us which sets P^> can be reached from points in P% in 
one time step. Then transitive hyperbolic maps admit a partition (in fact 
infinitely many) of phase space into rectangles V = (Pi, . . . , P m ), so that 

(1) if £ is a compatible sequence then there is a point x such that S k x 6 P^ k , 
see (for instance) Ch. 9 in [23], ("compatibility"). The points x outside the 
exceptional set B (of zero volume) determine uniquely the corresponding 
sequence £. 

(2) the diameter of the set of points E(£_i T , . . . , £it) consisting of all points 
which between time —\T and \T visit, in their evolution, the sets P^ is 
bounded above by ce~ c ' T for some c, d > (i.e. the code £ — > x determines 
x "wii/i exponential precision'). 

(3) there is a power k of Q such that > for all ("transitivity"). 

Hence points x can be identified with sequences of symbols £ verifying 
the compatibility property and the sequences of symbols determine, with 
exponential rapidity, the point x which they represent. 

The partitions V are called Markov partitions. Existence of V is non- 
trivial and rests on the chaoticity of motions: because the compatibility of 
all successive pairs implies that the full sequence is actually the history of a 
point (a clearly false statement for general partitions). 9 

If the map S has a time reversal symmetry I (i.e a smooth involution 
/, such that IS = 6'~ 1 /, see Eq.(2.1)) the partition V can be so built that 

IV = V, hence I Pi = Pi a) for some I(i). This is done simply by replacing 

V by the finer partition whose elements are Pi n IPj, because if V \V\ and 
T>2 are Markovian partitions also the partition IV is such, as well as the 
partition V\ V V2 formed by intersecting all pairs P G V\ , P' G V2 (this is 
best seen from the geometric interpretation in footonote 9 and from the time 

9 The Markovian property has a geometrical meaning: imagine each Pi as the "stack" of the 
connected unstable manifolds portions 5(x), intersections of Pi with the unstable manifolds of its 
points x, which will be called unstable "layers" in Pi. Then if Qij = 1, the expanding layers in 
each Pi expand under the action of S and their images fully cover the layers of Pj which they 
touch. Formally let Pi £ V and x S Pi, 5(x)=Pi n W n (x): the if Qij = 1, i.e. if SPi visits Pj, 
it is 8(Sx) C S8(x). 
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reversal property that IW u (x) = W s (Ix)). 

A Markov partition such that IV = V is called "reversible" and histories 
on it have the simple property that (£(7x))j = (^(x))_j^y 

Markov partitions, when existing, allow us to think of the phase space 
points as the configurations of a "1-dimensional spin system", i.e. as se- 
quences of finitely many symbols £ € {1,2,..., m} subject to the "hard core" 
constraint that Q^^ i+1 = 1- Hence probability distributions on phase space 
which give probability to the boundaries of the elements of the Markov 
partitions (where history may be ambiguous) can be regarded as stochastic 
processes on the configurations of a 1-dimensional Ising model (with finite 
spin m), and functions on phase space can be regarded as functions on the 
space of compatible sequences. 10 

The remarkable discovery, see reviews in [30, 32], is that the SRB dis- 
tribution not only can be regarded as a stochastic processes, but it is a 
short range Gibbs distribution if considered as a probability on the space of 
the compatible symbolic sequences £ on V, and with a potential function 
A($) = -A^(x(£)), see below and [28]. 

The sequences £ are therefore much more natural, given the dynamics S, 
than the sequence of decimal digits that are normally used to identify the 
points x via their cartesian coordinates. 11 

Definition: (Coarse graining,) Given a Markovian partition V let V T be the 

finer partition of phase space into sets of the form 

T/2 

= E^_ T/2 ,..., iT/2 = f S k P ik . (14.3) 

-T/2 

The sets E^ will be called "elements of a description of the microscopic 
states coarse grained to scale j" if 7 is the largest linear dimension of the 
nonempty sets . The elements E^ of the "coarse grained partition V T of 

10 It is worth also stressing that the ambiguity of the histories for the points which visit the 
boundaries of the sets of a Markovian partition is very familiar in the decimal representation 
of coordinates: it corresponds to the ambiguity in representing a decimal number as ending in 
infinitely many 0's or in infinitely many 9's. 

11 If the phase space points are considered as sequences £ then the dynamics becomes a "trivial" 
left shift of histories. This happens always in symbolic dynamics, but in general it is of little interest 
unless compatibility can be decided by a "hard core condition" involving only nearest neighbors 
(in general compatibility is a global condition involving all symbols, i.e. as a hard core it is one 
with infinite range). Furthermore also the statistics of the motion becomes very well understood, 
because short range ID Gibbs distributions are elementary and well understood. 
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phase space " are labeled by a finite string 

Z = (£-r/2,-",6r/2) (14-4) 
with & = 1, . . . , m and Q^ i+1 = 1. 

Define the forward and backward expansion and contraction rates as 



±T/2 ±T/2 

Ul!2{x) = £ AftS's), E#£(s) = £ K^x) (14.5) 
j=0 j=0 

and select a point £ £^ for each £. Then the SRB distribution hsrb 
and the volume distribution fj,L on the phase space f2, which we suppose to 
have Liouville volume, footnote p. 9, V(Q), attribute to the nonempty sets 
the respective probabilities \i and 



= Vsrb(Es) and respectively ^(^Y^t (14.6) 

if V{E) denotes the Liouville volume of E. The distributions fJ>,[iL are 
shown, [28, 23], to be defined by 

MO=^u(0- C ( -<- wt)) -<- ( " W)) (147) 

„ £ (£) = hl u{$ ) . e (C«(«))-CW«))) 

where /«(£) G is the center of P^ and h![ u (£), hg U (£) are suitable func- 
tions of £, uniformly bounded as £ and T vary and which are mildly depen- 
dent on £; so that they can be regarded as constants for the purpose of the 
present discussion, cfr. Ch. 9 in [23]. 

If 7 is a scale below which all interesting observables are (for practical 
purposes) constant, then choosing T = 0(log7~ 1 ) the sets E$ are a coarse 
graining of phase space suitable for computing time averages as weighted 
sums over the elements of the partition. 

And both in equilibrium and out of equilibrium the SRB distribution 
will not attribute equal weight to the sets E^. The weight will be instead 
proportional to e y ~ Uu ~ ( K (^~ u u,+ ( K (€))) ; j e ^ Q ^he inverse of the exponential 
of the expansion rate of the map S T along the unstable manifold and as a 
map of S~ 2 to S 2 /«(£). The more unstable the cells are the less weight 
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they have. Given Eq. (14.7) the connection with the Gibbs state with 
potential energy A(£) = A"(£) appears, see [28, Sec. 4. 3 and Ch. 5,6]. 

The sets E$ represent macroscopic states, being just small enough so that 
the physically interesting observables have a constant value within them; 
and we would like to think that they provide us with a model for a "coarse 
grained' description of the microscopic states. The notion of coarse graining 
is, here, precise and, nevertheless, quite flexible because it contains a free 
"resolution parameter" 7. Should one decide that the resolution 7 is not 
good enough because one wants to study the system with higher accuracy 
then one simply chooses a smaller 7 (and, correspondingly, a larger T). 

15 A2: SRB and Coarse Graining: a physicist's 
view 

How can the analysis of Appendix Al be reconciled with the numerical 
simulations, and with the naive view of motion, as a permutation of cells? 
The phase space volume will generally contract with time: yet we want to 
describe the evolution in terms of an evolution permuting microscopic states. 
Also because this would allow us to count the microscopic states relevant 
for a given stationary state of the system and possibly lead to extending to 
stationary nonequilibria Boltzmann's definition of entropy. 

Therefore we divide phase space into equal parallelepipedal microcells A 
of side size e <C 7 and try to discuss time evolution in terms of them: we 
shall call such cells "microscopic" cells, as we do not associate them with 
any particular observable; they represent the highest microscopic resolution. 

The new microcells should be considered as realizations of objects alike 
to those arising in computer simulations: in simulations the cells A are the 
"digitally represented" points with coordinates given by a set of integers and 
the evolution S is a program or code simulating the solution of equations 
of motion suitable for the model under study. The code operates exactly 
on the coordinates (the deterministic round offs, enforced by the particular 
computer hardware and software, should be considered part of the program). 

The simulation will produce (generically) a chaotic evolution "for all 
practical purposes", i.e. 

(1) if we only look at "macroscopic observables" which are constant on the 
coarse graining scale 7 = e~2 XT £ of the partition V T , where £0 is the phase 
space size and A > is the least contractive line element exponent (which 
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therefore fixes the scale of the coarse graining, by the last definition); 12 and 
(2) if we look at phenomena on time scales far shorter than the recurrence 
times (always finite in finite representations of motion, but of size usually 
so long to make the recurrence phenomenon irrelevant). 13 
The latter conclusion can be reached by realizing that 

(a) there has to be a small enough division into microcells that allows us 
to describe evolution as a map (otherwise numerical simulations would not 
make sense); 

(b) however the evolution map cannot be, in general, a permutation. In 
simulations it will happen, essentially always, that it (i.e. the software pro- 
gram) will send two distinct microcells into the same one. It does certainly 
happen in nonequilibrium systems in which phase space contracts in the 
average; 14 

(c) even though the map will not be one-to-one, nevertheless it will be 
such eventually: because any map on a finite space is a permutation of 
the points which are recurrent. This set is the attractor of the motions, 
that we call A and which will be imagined as a the collection of microcells 
approximating the unstable manifold and intersecting it. All such microcells 
will be considered taking part in the permutation: but this is not an innocent 
assumption and in the end is the reason why the SRB is unique, see below. 

(d) every permutation can be decomposed into cycles: each cycle will visit 
each coarse cell with the same frequency (unless there are more than one 
stationary distributions describing the asymptotics of a set of microcells 
initially distributed uniformly, a case that we exclude because of the transi- 
tivity assumption). Hence it is not restrictive to suppose that there is only 
one cycle ( "ergodicity" on the attractor). 

Then consistency between the expansion of the unstable directions and 
the existence of a cyclic permutation of the microcells in the attractor 

12 Here it is essential that the CH holds, otherwise if the system has long time tails the analysis 
becomes much more incolved and so far it can be dealt, even if only qualitatively, on a case by 
case basis. 

13 To get an idea of the orders of magnitude consider a gas of N particles of density p at 

dp 2 dq 2 

temperature T: the metric on phase space will be ds 2 = X^fc^T ^ -2/3 )> ncncc the size 

of a microccll will be \J O(N) <5q if 80 is the precision with which the coordinates arc imagined 
determined (in simulations <5o — 10 -14 in double precision) as all contributions to ds 2 are taken 
of order O(l). Coarse grained cells contain, in all proposals, many particles, O(N), so that their 
size will contain a factor S rather than 5q and will be 5/Sq = 0(N 1 ^ 3 ) larger. 

14 With extreme care it is sometimes, and in equilibrium, possible to represent evolution with 
a code which is a true permutation: the only example that I know, dealing with a physically 
relevant model, is in [79] . 
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A demands that the number of microcells in each coarse grained cell E^, 
Eq.(14-3), must be inversely proportional to the expansion rate, i.e. it has 
to be given by the first of Eq. (14.7). 



More precisely we imagine, developing a heuristic argument, that the 
attractor in each coarse cell £ (£) will appear as a stack of a few portions 
of unstable manifolds, the "layers" of footnote 9 , whose union form the (dis- 
connected) surface A(£) intersection between £ (£) and the attractor. Below 
A(£) will be used to denote both the set and its surface, as the context de- 
mands. The stack of connected surfaces A(£) is imagined covered uniformly 
by N(£) microcells, see Fig.4. 

def 

Let t = T + l. Transitivity implies that there is a smallest integer m > 
such that S t+m £(£) intersects all other £ (£'): the integer m is t-independent 
(and equal to the minimum m such that Q™^ > 0). In t + m time steps 
each coarse cell will have visitied all the others and the layers describing 
the approximate attractor in a single coarse cell will have been expanded 
to cover the entire attractor for the map S t+m . 15 The latter coincides with 
the attractor for S because S 3 is transitive for all j if it is such for j = 1 
and this property has to be reflected by the discretized dynamics at least if 
j is very small compared to the (enormous) recurrence time on the discrete 
attractor as t is, being a time on the coarse grain scale. 

Suppose first that m = 0, hence £*A(£) is the entire attractor for all £. 
This is an assumption useful to exhibit the idea but unrealistic for invertible 
maps: basically this is realized in the closely related SRB theory for a class 
of non invertible expansive maps of the unit interval) . 

15 To see this it is convenient to remark that the S*+ m -image of a layer 5(x) C A(£) of the 
attractor will cover some of the layers of A(£), because S t £(£) visits and fully covers all coarse 
cells £(£'), see footnote 9 . Hence S* +m A(4) will fully cover at least part of the layers of the 
attractor in £ (£) . Actually it will cover the whole of A (£) , because if a layer of A (£) was left out 
then it will be left out of all the iterates of S t+m and a nontrivial invariant subset of the attractor 
for 5* would exist. 




Fig.4: A very schematic and idealized drawing of the attractor layers A(£), remaining after a 
transient time, inside a coarse cell the second drawing (indicated by the arrow) represents 

schematically what the layers really are, if looked closely: namely collections of microcells laying 
uniformly on the attractor layers, i.e. the discretized attractor intersected with the coarse cell. 
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So the density of microcells will be = -^M and under time evolution 
S f the unstable layers A(£') in £(£) expand and cover all the layers in the 
cells £ (£'). If the coarse cell £ (£) is visited, in t = T + 1 time steps, by 
points in the coarse cells a property that will be symbolically denoted 
£g G S~ t £(£), a fraction i/^/ of the N(£') microcells will end in the coarse 
cell £(£), and = 1- Then consistency with evolution as a cyclic 

permutation demands 

^) = E^^m= f C(N m , ,e. (15.1) 

because the density of the microcells on the images of A(£') decreases by 
the expansion factor e A "- T ^'\ so that u^i = a(^) a Tti 7 ) • 

As a side remark it is interesting to point out that for the density p(£) 
Eq.(15.1) becomes simply p(£) = e~ Au > T ^'^ p(£'), closely related to the 
similar equation for invariant densities of Markovian surjectiive maps of the 
unit interval, [28]. 

The matrix C has all elements > (because m = 0) and therefore has 
a simple eigenvector v with positive components to which corresponds the 
eigenvalue A with maximum modulus: v = \C{v) (the "Perron-Frobenius 
theorem") with A = 1 (because v £,t' = !)• It follows that the consistency 
requirement uniquely determines N{£) as proportional to v$. Furthermore 
S t A(^) is the entire attractor; then its surface is £ independent and equal 
to e Au ' T ^A(£): therefore iV(£) = const e - A «,r(€). 

The general case is discussed by considering S t+m instead of S 1 *: this 
requires taking advantage of the properties of the ratios e A "> T ^/e Au ' T + m ^. 
Which are not only uniformly bounded in T but also only dependent on the 
sequence £ = (£_i T , . . . , Clr) through a few symbols with labels near —\T 

and \T: this correction can be considered part of the factors n in the 
rigorous formula Eq.(14.7). 

Note that e Au ' T ^A(£) = constant reflects Pesin's formula, [28], for the 
approximate dynamics considered here. 

So the SRB distribution arises naturally from assuming that dynamics 
can be discretized on a regular array of point ("microcells") and become a 
one cycle permutation of the microcells on the attractor. This is so under 
the CH and holds whether the dynamics is conservative (Hamiltonian) or 
dissipative. 

Remark: It is well known that hyperbolic systems admit (uncountably) 
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many invariant probability distributions, besides the SRB. This can be seen 
by noting that the space of the configurations is identified with a space of 
compatible sequences. On such a space one can define uncountably many 
stochastic processes, for instance by assigning an arbitrary short range trans- 
lation invariant potential, and regarding the corresponding Gibbs state as a 
probability distribution on phase space. However the analysis just presented 
apparently singles out SRB as the unique invariant distribution. This is due 
to our assumption that, in the discretization, microcells are regularly spaced 
and centered on a regular discrete lattice and evolution eventually permutes 
them in a (single, by transitivity) cycle consisting of the microcells located 
on the attractor (and therefore locally evenly spaced, as inherited from the 
regularity of the phase space discretization) . 

Other invariant distributions can be obtained by custom made discretiza- 
tions of phase space which will not cover the attractor in a regular way. 
This is what is done when other distributions, "not absolutely continuous 
with respect to the phase space volume", are to be studied in simulations. A 
paradigmatic example is given by the map x — > 3x mod 1: it has an invariant 
distribution attributing zero probability to the points x that, in base 3, lack 
the digit 2: to find it one has to write a program in which data have this 
property and make sure that the round off errors will not destroy it. Almost 
any "naive" code that simulates this dynamics using double precision reals 
represented in base 2 will generate, instead, the corresponding SRB distri- 
bution which is simply the Lebesgue measure on the unit interval (which is 
the Bernoulli process on the symbolic dynamics giving equal probability | 
to each digit). 

The physical representation of the SRB distribution just obbtained, see 
[29, 23], shows that there is no conceptual difference between stationary 
states in equilibrium and out of equilibrium. In both cases, if motions are 
chaotic they are permutations of microcells and the SRB distribution is 
simply equidistribution over the recurrent microcells. In equilibrium this 
gives the Gibbs microcanonical distribution and out of equilibrium it gives 
the SRB distribution (of which the Gibbs one is a very special case). 

The above heuristic argument is an interpretation of the mathematical 
proofs behind the SRB distribution which can be found in [80, 28], (and 
heuristically is a proof in itself). Once Eq. (14.7) is given, the expectation 
values of the observables in the SRB distributions can be formally written as 
sums over suitably small coarse cells and symmetry properties inherited from 
symmetries of the dynamic become transparent. The Fluctuation Theorem 
is a simple consequence of Eq. (14.7), see Appendix A3: however it is 
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conceptually interesting because of the surprising unification of equilibrium 
and nonequilibrium behind it. 

The discrete repesentation, in terms of coarse grain cells and microcells 
leads to the possibility of counting the number J\f of the microcells and 
therefore to define a kind of entropy function: see [21] where the detailed 
analysis of the counting is performed and the difficulties arising in defining 
an entropy function as fee log M are critically examined. 

16 A3: Why does FT hold? 

As mentioned the proof of FT in quite simple, [26]. By the first of Eq. 
(14.5), (14.7) and by the theory of lL)-short range Ising models, see [39] for 
details, the probability that p is in a small interval centered at p compared 
to the probability that it is in the opposite interval is 

p,x r . e -E:C%A¥(S*« i )+B(i ) r) 

f T \p) _ L^t-tpa+T c (16 1) 



Pr(-P) v _-E- T % A 5 , (S fc «i)+B(i,T) 
2—ii—>—pa + T e 

where Ei->po- + r i s sum over the centers «j of the rectangles Ei labeled by 

def 

i = (£-t/2, ■ ■ £1-/2) with the property 

r/2 

a(S k Ki)+B(i,T)~pa + T (16.2) 

fe=-r/2 

where ~ means that the left hand side is contained in a very small interval 
(of size of order O(l), [39], call it b) centered at pcx+r; the B(i, r) is a term of 
order 1 (a boundary term in the language of the Ising model interpretation 
of the SRB distribution): \B(i, r)| < b < +00: and it takes also into account 
the adjustments to be made because of the arbitrariness of the choice of Kj. 16 
Independence on i,r of the bound on B(i,r) reflects smoothness of S and 
elementary properties of short range ID Ising chains, [39]. 

Suppose that the symbolic dynamics has been chosen time reversible, 
i.e. the time reversal map / maps Pj into I Pi = Pj^ for some this 



16 Which is taken here Ki =the center of P^ , but which could equivalently made by 
choosing other points in E^, for instance by continuing the string i — (£_ T / 2 , • • • ,£1-/2) to 
the right and to the left, according to an a priori fixed rule depending only on £ T / 2 and 
£-1-/2 respectively. Thus turning it to a biinfinite compatible string £j which therefore 
fixes a new point k[. 
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is not a restriction as discussed in Appendix Al. Then the above ratio of 
sums can be rewritten as a ratio of sums over the same set of labels, 

() y, -£:/ t 2 /2 A?(S^ W ,t) 

Remark that A"(/x) = — Af(x) (by time reversal symmetry) and that (by 
Eq. (14.3)) J2l=- T/2 ( A i( Sk (. x )) + A i( S ~ k (. x ))) can be written as 

r/2 r/2 

£ (A^(*))+A?(S fe (*))) = ^ a^x) + 5(x,r) (16.4) 

k=-r/2 k=— r/2 

with B(x,t) < b (again by the smoothness of 5), possibly redefining 6. 

Therefore the ratio of corresponding terms in the numerator and denom- 
inator (i.e. terms bearing the same summation label i) is precisely pcr + T up 
to ±36. Hence 

e ra+p-3b < < e Ta + P+3b (16.5) 

so that FT holds for finite r with an error ±^r, infinitesimal as r — > +oo. 
For a detailed discussion of the error bounds see [39]. 

Of course for all this to make sense the value of p must be among those 
which not only are possible but also such that the values close enough to 
possible values are possible. This means that p has to be an internal point 
to an interval of values that contains limit points of linv^-i-oo ^ X^fc=o — & — 
for a set of x's with positive SRB probability: the value p* in FT is the 
supremum among the value of p with this property, [39] (contrary to state- 
ments in the literature this physically obvious remark is explicitly present 
in the original papers: and one should not consider the three contemporary 
references, [26, 39, 40], has having been influenced by the doubts on this 
point raised much later.) 

The assumptions have been: (a) existence of a Markovian partition, i.e. 
the possibility of a well controlled symbolic dynamics representation of the 
motion; (b) smooth evolution S and (c) smooth time reversal symmetry: 
the properties (a),(b) are equivalent to the CH. Of course positivity of a + is 
essential, in spite of contrary statements; if a + = the leading terms would 
come from what has been bounded in the remainder terms and, in any event 
the analysis world be trivial, with or without chaoticity assumptions, [64]. 
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Since Lorenz, [81], symbolic dynamics is employed to represent chaos 
and many simulations make currently use of it; smoothness has always been 
supposed in studying natural phenomena (lack of it being interpreted as a 
sign of breakdown of the theory and of necessity of a more accurate one); 
time reversal is a fundamental symmetry of nature (realized as T or TCP in 
the Physics notations) . Hence in spite of the ease in exhibiting examples of 
systems which are not smooth, not hyperbolic, not time reversal symmetric 
(or any subset thereof) the CH still seems a good guide to understand chaos. 

17 A4: Harmonic Thermostats 

Here the "efficiency" of a harmonic thermostat is discussed. It turns out 
that in general a thermostat consisting of infinite free systems is a very 
simple kind of Hamiltonian thermostat, but it has to be considered with 
caution as it can be inefficient in the sense that it might not drive a system 
towards equilibrium (i.e. towards a Gibbs distribution). In the example 
given below a system in interaction with an infinite harmonic reservoir at 
inverse temperature (5 is considered. It is shown that the interaction can lead 
to a stationary state, of the system plus reservoir, which is not the Gibbs 
state at temperature (3~ 1 . The following is a repetition of the analysis in 
[18], adapting it to the situation considered here. 

A simple model is a 1-dimensional harmonic oscillators chain, of bosons 
or fermions, initially in a Gibbs state at temperature The Hamiltonian 
for the equilibrium initial state will be 

N-l *2 N-l 2 N 2 

H ° = E -^ A ^ + E + E - <fa-i) 2 (i7.i) 

x=l x=l x=l 

with boundary conditions <7o = <Zjv = and h, m, oJ 2 ,p 2 > 0. The initial state 
will be supposed to have a density matrix po = ^ e -3g u ■ Time evolution 
will be governed by a different Hamiltonian 

H x = H + ^q 2 , A + u; 2 >0 (17.2) 

The question of "thermostat efficiency" is: does pt d = e~a tHx pQe~~a tHx con- 
verge as t —> +oo to poo = r^ ve -(3H x ■ Or: does the system consisting in the os- 
cillators labeled 2,3,... succeed in bringing up to the new equilibrium state 
the oscillator labeled 1? Convergence means that the limit (A) t _> +a J 
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(A) exists, at least for the observables A essentially localized in a finite 
region. 

The Hamiltonian in Eq.(17.2) can be diagonalized by studying the matrix 



V\ = m 



/lo 2 + 2/r + A -ii 1 ...\ 

-fj 2 uj 2 + 2/i 2 -fi 2 

-ii 2 w 2 + 2/z 2 .. 



V + \mPi (17.3) 



V ... / 

The normalized eigenstates and respective eigenvalues of Vq are 



= f V ^ sm fx, A° = m (u, 2 + 2 M 2 (1 - cos f )) (17.4) 

and the vectors ^ will be also denoted \k) or |^°). 

To solve the characteristic equation for V^, call ^ a generic normalized 
eigenvector with eigenvalue A; the eigenvalue equation is 

(fc|tf)(Ag-A) + Am(fc|rt)<fl|tf) =0 (17.5) 

where ft is the vector ft = (1, 0, . . . , 0) € C^ -1 , so that Pi = |ft)(ft|. Hence, 
noting that (ft I*) cannot be because this would imply that A = A^ for 
some k and therefore |*) = \k) which contradicts (ft|*) = 0, it is 

(17.0) 

and the compatibitity condition that has to be satisfied is 

Xm -^a_ao^-^ N A-A0- ^-7) 

Once Eq.(17.7) is satisfied, Eq.(17.6) imply that the eigenvalue equation, 
Eq.(17.5), is satisfied, and by a |^) / (determined up to a factor). 

The Eq.(17.7) has iV— 1 solutions, corresponding to the N— 1 eigenvalues 
of V\. This follows by comparing the graph of y(A) = with the graph of 
the function of A in the intermediate term of Eq.(17.7). One of the solutions 
remains isolated in the limit N — > oo, because the equation 



2 Am f w sin 2 n , n . d e / / 2 2 . 2 k\ 

1 = — / o -^ao^)^ AV)=^(. 2 + 4 M 2 sm 2 -) (17i 
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has, uniformly in N, only one isolated solution for A < inf A°(/c) = muo 2 if 
A < 0, or for A > sup A (re) if A < 0. Suppose for defmiteness that A < 0. 

Let tyu(x), k = 1, . . . ,N — 1, be the corresponding eigenfunctions. The 
matrices U\-k, x = ^k( x ) are unitary and (U\)\=o = Uq. It is Uo-k, x = 
sin^x and <*°|M/£,) = Zjv( ^_ A o) with Z N (k'f = £ fc by 
Eq.(17.6). Then setting a± = let 

4;k =Va« + )*, «A; fc = d =(«-^) fe (17.9) 

where £/* is the adjoint of U (so that UU* = 1 if U is unitary). It is 

<4 = S^A;fc,x°ifc' °A;fc = H U \ik,yUo;h,y<4 th (17.10) 

if the overbars denote complex conjugation. 

The operators a^ k will be creation and annihilation operators for quanta 

with energy %\J~^- d = E\{k). So a state with n& = 0, 1, . . . quanta in state 
fc will have energy £ fe E\{k){n k + ^). 

Consider the observable ati a \i = ^- Its average is time independent, 

de f 

in the evolution generated by H\, and if W = U\Uq it is equal to 



( A ) Pt = ( A ) P0 = TV Po (Wa+) 1 (Wa -) 1 
= ^Tr PoW hk W hk/ a+ k a^ k , = ^ |Wi )fc 



^\ T „ l2 E^ e-^ (fc)n n ( 17 - n ) 



fc=i 



y- n / e -f3E (k)n 



where n/ = 1 if the statistics of the quanta is fermionic (this was the case 
in [18]) or nf = +oo if it is bosonic. In the two cases the result is 

If the system reached thermal equilibrium, setting p\(k) d = e/3 E x \k) ±1 i t ms 
should be p\(l), which is impossible, as it can be checked by letting f3 — > +oo 
and remarking that it is E\(l) < Eq(1) with a difference positive uniformly 
in N. Furthermore the observable A is localized near the site x = 1: because 
the wave function of the lowest eigenvalue is ^(l) a^a* l^fe) so * nat 

*^(l)^(x) 1 2 / >7r sin re sin ra 



* 1(X) "^(1)^ A*-A° ^ Zoo Wo A^-A°(ref K (17 ' 13) 
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and the integral tends to as x — ► oo faster than any power, so that < 
Zoo < °o and ^ is normalizable. 

Therefore the thermostatic action of the system in the sites 2,3,... on 
the site 1 is not efficient and the state does not evolve towards the Gibbs 
state at temperature not even in the limit N — > +oo. 

This result should be contrasted with the closely related case in which 
the system oscillator at 1 plus the others is started in a equilibrium state for 
H\ and at time is evolved with Hamiltonian Hq. In this case the system 
thermalizes properly, see the analogous analysis in [18], see also [14] for a 
large class of related examples. 

Of course the question of effectiveness of a thermostat could be discussed 
also for non linear theormostats, finite or infinite. It seems that, under mild 
assumptions, non linear thermostat models should be efficient, i.e. generate 
proper heat exchanges even when acting only at the boundary as in the 
case of the thermostats considered in Sec. 9. The analysis in [82] gives some 
preliminary evidence in this direction. 

Harmonic thermostats are nevertheless very interesting, provided the 
above pathologies are excluded by a careful formulation of the models: see 
for instance [14], see also [17]. It is also clear that the pathologies seem to be 
related to the fact that the thermostats constituents are "not interacting" 
or "linearly interacting": their origin in the above analysis is shown to be 
related to the existence of isolated eigenvalues of the Hamiltonian at the 
bottom of the spectrum and this is the property that should be excluded. 
The pathologies are likely to be absent in models in which there is nonlinear 
interaction within the thermostats constituents so that such models should 
be perfectly well behavng (i.e. efficient in the sense of this paper). However 
the latter models are also highly nontrivial even at a purely mathematical 
level. 

18 A5: Bohmian Quantum Systems 

Consider the system in Fig.l and suppose, as in Sec. 10, that the nonconser- 
vative force E(Xo) acting on the system vanishes, i.e. consider the problem 
of heat flow through Co- Let H be the operator on L<2(Cq No ), space of sym- 
metric or antisymmetric wave functions 

fix = -^-A Xo + Uq(X ) + E (U 0j (X , Xj) + UjiXj) + Kj) (18.1) 
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where Ax is the Laplacian, and note that its spectrum consists of eigen- 

def 

values E n = E n ({X.j}j > o), depending on the configuration X = {Xj}j>o, 

Thermostats will be modeled as assemblies of classical particles as in 
Sec. 9: thus their temperature can be defined as the average kinetic energy 
of their particles and the question of how to define it does not arise. 

The viewpoint of Bohm on quantum theory seems quite well adapted to 
the kind of systems considered here. A system-reservoirs model can be the 
dynamical system on the variables (^,Xq, ({Xj}, {Xj})j>o) defined by 



-in*(Xo)=(flx*)(Xo), 

X =Mm 9X »*J X0) , and for j > 
= - (^(X,) + djUjiXo, Xjj) - aj±j 



(18.2) 



def Wj ~ Uj tx r de_f 

2K 4 



here the first equation is Schrddinger's equation, the second is the vlocity 
of the Bohmian particles carried by the wave the others are equations of 
motion for the thermostats particles analogous to the one in Eq.(9.1), (whose 
notation for the particles labels is adopted here too). Evolution maintains 
the thermostats kinetic energies Kj = ^X^ exactly constant so that they 
will be used to define the thermostats temperatures Tj via Kj = ^ksTjNj, 
as in the classical case. 

Note that if there is no coupling between system and thermostats, i.e. 
the system is "isolated", then there are many invariant distributions: e.g. 
the probability distributions [i proportional to 



oo 

e- p ° En 5{^> - e i{pn ) |f (Xo)| 2 d9? n dX II 5 ( X i ~ ZK^dXjdXj (18.3) 

71=1 j 

where E n and ^f n are time independent, under the assumed absence of in- 
teraction between system and thermostats, and are the eigenvalues and the 
correspoding eigenvectors of H. Then the distributions fi are invariant under 
the time evolution. 

Time invariance of this kind of distributions is discussed in [83, Sec. 4], 
where it appears as an instance of what is called there a "quantum equilib- 
rium". The average value of an observable 0(X.q), which depends only on 
position Xq, will be the "usual" Gibbs average 
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<0> /4 = Z- i yTr(e-A» H 0)) (18.4) 

For studying nonequilibrium stationary states consider several thermo- 
stats with interaction energy with Co, Wj(Xo,X.j), as in Eq. (9.1). The 
equations of motion should be Eq. (18.2) 

In general solutions of Eq.(18.2) will not be quasi periodic and the Chaotic 
Hypothesis, [40, 23, 58], can be assumed: if so the dynamics should select 
an invariant distribution fi. The distribution n will give the statistical prop- 
erties of the stationary states reached starting the motion in a thermostat 
configuration (Xj,Xj)j>o, randomly chosen with "uniform distribution" v 
on the spheres mX^ = SNjksTj and in a random eigenstate of H. The 
distribution /x, if existing and unique, could be named the SRB distribution 
corresponding to the chaotic motions of Eq.(18.2). 

In the case of a system interacting with a single thermostat the latter 
distribution should be equivalent to the canonical distribution. As in Sec. 11 
an important consistency check for the model just proposed in Eq.(18.2) is 
that there should exist at least one stationary distribution fi equivalent to 
the canonical distribution at the appropriate temperature T\ associated with 
the (constant) kinetic energy of the thermostat: K\ = §&bTi N±. However 
also in this already in Sec.ll, it does not seem possible to define a 

simple invariant distribution, not even in the adiabatic approximation. As 
in Sec.ll, equivalence between fx and a Gibbs distribution at temperature 
T\ can only be conjectured. 

Furthermore it is not clear how to define phase space contraction, hence 
how to formulate a FT, although the equations are reversible. 
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